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RECEPTORS AND ASSOCIATED PROTEINS 



TECHNICAL FIELD 

This invention relates to nucldc acid and amino acid sequences of recq>tors and associated 
5 protdns and to the use of these sequences in the diagnosis, treatment, and prevention of neurological 
disord^s; immunological disordors, including autoimmune/inflammatory disordo-s; and cell proliti^ative 
disordo's, including cancer. 

BACKGROUND OF THE INVENTION 

10 The term receptor describes a protdn that specifically recognizes other molecules. The category 

is broad and includes proteins with a variety of functions. The bulk of recq^tors are cell surface protdns 
which bind extracellular ligands and produce cellular responses in the areas of growth, differ^tiation, 
mdocytosis, and immune response. Other rec^tors facilitate the sdectiye transport of proteins out of the 
endoplasmic reticulum and localize enzymes to particular locations in the cell. Propagation of cdlular 

IS signals, and transport and localization of protons, all rely upon specific impactions between recq)tors 
and a variety of associated proteins. The t^m recq>tor may also be applied to proteins which bind to 
ligands with known or unknown ch^nical composition and which interact with other cdlular conqx)nents. 
For example, the steroid hormone recq)tors bind to and regulate transcription of DN A. 

Cell surface recq)tors are typically integral plasma membrane proteins. These recq)tors 

20 recognize hormones such as catecholamines; pq)tide hormones; growth and diffi^entiation factCH-s; small 
pq)tide factors such as thyrotropin-rdeasing hormone; galanin, somatostatin, and tachykinins; and 
circulatory system-bCH'ne signaling molecules. Cell surface receptors on immune system cells recognize 
antigens, antibodies, and major histoconq^atibility complex (MHC)-bound peptides. Other cell surface 
recq)tors bind ligands to be int^nalized by the cell. This recq>tor-mediated endocytosis functions in the 

25 uptake of low density lipoproteins (LDL), transferrin, glucose- or mannose-tarminal glycoproteins, 
galactose-tea-minal glycoproteins, immunoglobulins, phosphovitellogenins, fibrin, protdnase-inhibitor 
complexes, plasminogen activators, and thrombospondin (Lodish, H. et al. (1995) Molecular Cell 
Biology . Scientific American Books, New York NY, p. 723; and Mikhailenko, I. et al. (1997) J. Biol. 
Chem. 272:6784-6791). 

30 Signal transduction is the process of biochemical events by which cells are able to 

communicate with one another and respond to extracellular signals. Extracellular signals are 
transduced through a biochemical cascade that begins with the binding of a signal molecule to a cell 
membrane receptor. The signal is propagated to effector molecules by intracellular signal transducing 
proteins and culminates with the activation of an intracellular target molecule. The process of signal 

35 transduction regulates a wide variety of cell functions including cell proliferation, differentiation, and 
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gene transcription. 

G-protein Coupled Receptors (GPCRs) 

G-protein coupled recq>tors (GPCRs) are a class of molecules that participate in signal 

5 transduction in a variety of cell types. GPCRs are integral membrane proteins diaractoized by the 
presence of seven hydrophobic transmembrane domains which span the plasma membrane and form a 
bundle of antiparallel alpha (a) helices. These proteins range in size from under 400 to over 1000 amino 
acids (Strosberg, A.D. (1991) Eur. J. Biochent 196:1-10; Coughlin, S.R. (1994) Curr. Opin. Cell Biol. 
6:191-197). The amino-terminus of the GPCR is extracellular, of variable length and often glycosylated; 

10 the carboxy-terminus is cytoplasmic and generally phosphorylated. Extracdlular loops of the GPCR 
alternate with intracellular loops and link the transmembrane domains. The most conserved domains of 
GPCRs are the transmambrane domains and the first two cytoplasmic loops. The transmembrane 
domains account for structural and lunctional features of the recqjtor. In most cases, the bundle of a 
hdices forms a binding pocket. In addition, the extracdlular N-terminal segment or one or more of the 

15 three extracellular loops may also participate in ligand binding. Ligand binding activates the rec^tor by 
inducing a conformational change in intracellular portions of the recqptor. The activated receptor, in 
turn, inta-acts with an intracellular heto-otrimeric guanine nucleotide binding (G) protdn conq;)lex which 
mediates fiirtho* intracdlular signaling activities, generally the production of second messengers such as 
cyclic AMP (cAMP), phospholipase C, inositol triphosphate, or interactions with ion channel proteins. 

20 (Baldwin, J.M. (1994) Curr. Opin. Cell Biol. 6:180-190; Watson, S. and S. Arkinstall (1994) The G- 
protein Linked Receptor Facts Book . Academic Press, San Diego CA, pp. 2-6.) Hydrolysis of bound 
GTP by the G-protein completes the cycle, returning the G-protein to its inactive GDP-bound state. 

GPCRs include receptors for senscHy signal mediators (e.g., light and olfactory stimulatory 
molecules); adenosine, bombesin, bradykinin, endothelin, y-aminobutyric acid (GAB A), hq)atocyte 

25 growth factor, lutdnizing hc»-mone (LH), thrombin, thyroid stimulating hormone (TSH), melanocortins, 
neurop^tide Y, opioid peptides, opsins, somatostatin, tachykinins, vasoactive intestinal polypeptide 
family, and vasopressin; biogenic amines (e.g., dopamine, q>inq)hrine and norepinephrine, histamine, 
glutamate (metabotropic effect), acetylcholine (muscarinic effea), and serotonin); chemokines; lipid 
mediators of inflammation (e.g., prostaglandins and prostanoids, platelet activating factor, and 

30 leukotrienes); and pq^tide hormones (e.g., calcitonin, C5a anaphylatoxin, follicle-stimulating hormone 
(FSH), gonadotropic-rdeasing hormone (GnRH), neurokinin, and thyrotropin-releasing hormone (TRH), 
and oxytocin). GPCRs which act as recq)tors for stimuli that have yet to be idoitified are known as 
orphan recqitors. For example, the TPRA40 protein is a GPCR isolated from mouse adipocytes and 
present in a numba- of mouse and human tissues, whose expression in adipose tissue is altered with aging 

35 and type 2 diabetes (Yang, H. (1999) Endocrinology 140:2859-2867). 
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GPCR mutations, which may cause loss of function (m* constitutive activation, have been 
associated with num^ous human diseases (Coughlin, supraV For instance, rednitis pigmentosa may 
arise from mutadons in the rhodopsin gene. Rhodopsin is the retinal photorecq)tor which is located 
within the discs of the eye rod cdl. Parma, J. et al. (1993, Nature 365:649-651) rqjort that somatic 
5 activating mutations in the thyrotropin receptor cause hyperfunctioning thyroid adenomas and suggest 
that certain GPCRs suscq^tible to constitutive activation may behave as protooncogenes. Elevated levels 
of TSH recGptor have been observed in brain tissue from Down syndrome and Alzheim^'s disease 
patients, suggesting an apoptotic role for this recq)tor in neurodegraerative disorders (Labudova, O. et 
al. (1999) Life Sci. 64:1037-1044). Many clinically relevant drugs act on GPCRs, including a and p 
10 blockers which affect the activity of adrenergic receptors and are used in the treatment of hypmension 
and other cardiovascular discwders (WaLson, supra , pp. 32-33). 



Receptors Involved in the Immune Svstem 

Examples of GPCRs implicated in inflanmiation and the immune response include the EGF 
15 module-containing, mucin-like hormone receptor (Emrl) and CD97 receptor proteins. These GPCRs 
are members of the recentiy characterized EGF-TM7 receptors family. These seven transmembrane 
hormone receptors exist as heterodimers in vivo and contain between three and seven potential 
calcium-binding EGF-like motifs. CD97 is predominantly expressed in leukocytes, and is markedly 
upregulated on activated B and T cells. (McKnight, A. J. and Gordon, S. (1998) J. Leukoc. Biol. 
20 63:271-280.) 

Irregularities in the GPCR signaling cascade may result in abnormal activation of leukocytes and 
lymphocytes, leading to the tissue damage and destruction seen in many inflammatory and autoimmune 
diseases such as rheumatoid arthritis, biliary cirrhosis, hemolytic anemia, lupus oTthematosus, and 
thyroiditis. Abnormal cell prolifa'ation, including cyclic AMP stimulation of brain, thyroid, adrenal, and 
25 gonadal tissue proliferation is regulated by G proteins (Mdj, J.T.A. (1996) Mol. Cell. Biochem. 157:31- 
38; Aussel, C. et al. (1988) J. Immunol. 140:215-220). 

T cells play a dual role in the immune system as effectors and regulators, coupling antigen 
recognition with the transmission of signals that induce cell death in infected cells and stimulate other 
immune cells. Although T cells collectively recognize a wide range of different antigens, a clonal line of 
30 T cdls can only recognize a single antigen. Moreover, the antigen must be presented to the T cell 

receptor (TCR) as a peptide complexed with a major histocompatibility molecule (MHC) on the surface 
of an antigen-presenting cell. The TCR on most T cells consists of two polypq)tide subunits, a and p, 
which are inununoglobulin-like integral membrane glycoprotdns of similar molecular wdght. The TCRa 
and TCRp subunits have an ^tracellular domain containing both variable and constant regions, a 
35 transmembrane domain that traverses the m^brane once, and a short intracellular domain (Saito, H. et 
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al. (1984) Nature 309:757-762). The genes for the TCR subunits are constructed through somatic 
rearrangement of diffident gene segments. Interaction of antigen in the proper MHC context with the 
TCR initiates signaling cascades that induce the proliferation, maturation, and function of cdlular 
componaits of the immime system (Weiss, A. (1991) Annu. Rev. Genet. 25: 487-510). Rearrangements 
in TCR genes and alterations in TCR expression have been noted in lymphomas, leukemias, autoimmune 
disord^s, and immunodeficiency disorders (Aisenberg, A.C. et al. (1985) N. Engl. J. Med. 313:529-533; 
Olive, C. (1995) Immunol. Cell. Biol. 73:297-307; and Weiss, supra) . Immunizations withpq^tides 
d^ived from TCRs are effective treatment for some human T-cdl-mediated autoimmune disease and in 
animal models of such illnesses, in particular, rheumatoid arthritis (Bridges, S.L. and Mordand, L.W. 
(1998) Rheum. Dis. CUn. North Am. 24:641-650). 

Tumor necrosis factor (TNF) is a pleiotropic cytokine that mediates immune relation and 
inflammatory responses. The cellular responses triggered by TNF are initiated through its interaction 
with two distinct cell surface recq^tors, TNF-Rl and TNF-R2. (Tartaglia, L.A. and Goeddel, D. V. 
(1992) Immunol. Today 13:151-153). Both TNF recq>tQrs are part of the TNF receptor (TNFR) 
superfamily, whose members include the Fas antigen, the p75 subunit of the NGF receptor, the TRAIL 
recq)tor, TRUNND, SalF19R. CD27, CD30, and CD40. Members of the TNFR sup^family share the 
TNFR/NGFR family cysteine-rich region signature, which consists of cystane-rich pseudo-repeats in the 
extracellular domains. (ExPASy PROSITE document PDOC00561 ; Pan, G, &i al. (1998) FEBS Lett. 
424:41-45; Bairoch, A. et al. (1997) Nucldc Acids Res. 25:217-221; and Smith, C.A. et al. (1994) Ceil 
76:959-962), Polymorphisms in TNF-R2 are associated with systemic lupus erythematosus (Komata, T. 
et al. (1999) Tissue Antigens 53:527-533). In addition, increased serum concentrations of soluble TNF- 
Rl have been observed in some patients with advanced gastric or col^ectal cancer (Shibata, M. et al. 
(1998) Surg. Today 28:884-888). 

Another essential component of the immune response is the conq:)lenirat systm, which responds 
to signals provided by antigen recognition by mobilizing effector activities including inflammation, 
phagocytosis, and cell lysis. Recq^tors on macrcphages and neutrophils bind activated complement C3 
on the surface of foreign particles such as bactma, thus targeting the fcH'eign particles for phagocytosis 
and destruction by lysosomal enzymes. Complement recq>tor 1 (CRl) has a wide cellular/tissue 
distribution, and mediates enhancement of phagocytosis, induction of IL-1 secretion and enhancement of 
B-cdl diff^entiation. Defective expression of CRl is associated with the autoimmune disease systemic 
lupus erythematosis. (Carroll, M.C. (1998) Annu, Rev. Immunol. 16:545-568.) 

Nuclear Receptors 

The nuclear receptors are another recq>tor family, and includes the retinoic acid receptors 
(R ARs) and the retinoid X recq)tCH's (RXRs). RARs and RXRs can form heterodima-s which are 
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thought to have a signal transduction function. Retinoic acid (R A) is a biologically active metabolite of 
vitamin A (retinol), a fat-soluble vitamin found mainly in fish liv^ oils, liver, egg yolk, butter, and 
cream, Retinol cannot be synthesized in vivo and must be obtained from the diet. R^nol, RA, and other 
retinoids influence epithelial cell differentiation. A numb^ of carrier protdns which bind retinol or oth^ 
5 retinoids have been identified. These retinoid binding protdns (RBPs) appear to direct bound retinoid 
molecules to specific metabolic pathways. Specific recq)tors fca: RBPs mediate the cellular uptake of 
retinoids and the transfer of retinoids to intracellular RBPs (Sundaram, M. et al. (1999) J. Biol. Chan. 
273:3336-3342). 



O 

m 



10 Low Molecular Wdeht (LMW) G^protdns 

Low molecular weight (LMW) G-proteins regulate cell growth, c^l cycle control, protdn 
secretion, and intracellular vesicle intaraction. They consist of single polypq^tides which are able to bind 
1^ to and hydrolyze OTP, thus cycling betwe^ an inactive and an active state. LMW G-protdns respond to 

Q extracellular signals from receptors and activating proteins by transducing mitogenic signals involved in 

fyj 15 various cell fiinctions. The binding and hydrolysis of GTP regulates the response of LMW G-proteins 
J and acts as an energy source during this process (Bokoch, G. M. and Dot, C. J. (1993) FASEB J. 

O 7:750-759). 

J At least sucty members of thQ LMW G-protein superfamily have been identified and are currently 

grouped into the ras, rho, arf, sari, ran, and rab subfamilies. Activated ras genes were initially found in 
20 human cancm and subsequent studies confirmed that ras function is critical to receptor tyrosine kinase- 
mediated signal transduction pathways that deto-mine whether c^ls continue to grow and divide, or 
pj whether they differentiate. Rho G-protans control signal transduction pathways that link growth factor 

receptors to actin polymoization, which is necessary for normal cellular growth and division. The rab, 
arf, and sari families of protdns control the translocation of vesicles to and from membranes for protein 
25 localization, protein processing, and secretion. Ran G-protdns are located in the nucleus and have a key 
role in nuclear protdn impcat, the control of DNA synthesis, and cdl-cycle progression (Hall, A. (1990) 
Science 249:635-640; Barbacid, M. (1987) Ann. Rev Biochem. 56:779-827; and Sasaki, T. and Takai. 
Y. (1998) Biochan. Biophys. Res. Commua 245:641-645). 

LMW G-proteins are GTPases which cycle between the active GTP-bound and inactive 
30 GDP-bound forms. At least three types of proteins regulate this process: GTPase-activating proteins, 
(GAP), which stimulate GTP hydrolysis by the LMW G-protein; guanine nucleotide exchange factors 
(GEP), which facilitate the exchange of GDP bound to the LMW G-protein for GTP; and guanine 
nucleotide dissociation inhibitors (GDI), which inhibit this reaction (Ikeda, M. et al. (1998) J. Biol. 
Chem. 273:814-821 ; (Juilliam, L. A. (1995) Bioessays 17:395-404). The best characterized GEP is the 
35 mammalian homologue of the Drosophila Son-of-Sevenless protein. Both GEP and GAP activity may be 

5 
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affected by extracdlular stimuli and modified by accessory proteins such as RalBPl and POBl. Mutant 
Ras-family proteins, which bind but can not hydrolyze GTP. are permanently activated, and cause cell 
proliferation or cancer, as do GEP that activate LMW G-proteans (Drivas, G. T. et al. (1990) Mol. Cell. 
Biol. 10:1793-1798; and Whitdiead, 1. P. et al. (1998) Mol Cdl Biol. 18:4689-4697). 

5 

Olfactory GPCRs 

Another large subfamily of GPCRs are the olfactory recqptors. These receptors share the seven 
hydrq^hobic transmembrane domains of othex GPCRs and function by registCTing G protein-mediated 
transduction of odorant signals. Numerous distinct olfactory receptws are required to distinguish 
10 diffident odors. Each olfactory sensory neuron expresses only one type of olfactory recq)tor, and distinct 
spatial zones of neurons expressing distinct receptors are found in nasal pasages. 

The discovCTy of new receptors and associated proteins and the polynucleotides ^coding them 
satisfies a need in the art by providing new conq)OSitions which are useful in the diagnosis, prevention, 
and treatm^ of cell prolifo-ative, autoimmune/inflammatory, and neurological disorders. 

15 

SUMMARY OF THE INVENTION 

The invration features purified polypeptides, recq)tors and associated protdns, referred to 
collecUvely as "RECAP" and individually as "RECAP-1," "RECAP-2." "RECAP-3." "RECAP-4," 
"RECAP-5/' "RECAP-6/' "RECAP-7," "RECAP-8," "RECAP-9," "RECAP-10." "RECAP-ll." 

20 "RECAP-12/* "RECAP-13," "RECAP-14," "RECAP-15/* "RECAP-16," "RECAP-17/* "RECAP-18/' 
"RECAP-19." "RECAP-20," "RECAP-21," and "RECAP-22," In one aspect, the invention provides an 
isolated polypeptide comprising an amino add sequence selected fi'om the group consisting of a) an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, b) a naturally occurring 
amino acid sequence having at least 90% sequOTce identity to an amino add sequence sdected fi-om the 

25 group consisting of SEQ ID NO: 1-22, c) a biologically active fragment of an amino acid sequence 
sdected from the group consisting of SEQ ID NO: 1-22, and d) an immunogenic fragment of an amino 
acid sequ^ce selected fi-om the group consisting of SEQ ID NO: 1-22. In one alternative, the invwition 
provides an isolated polyp^tide comprising the amino acid sequence of SEQ ID NO: 1-22. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising an 

30 amino acid sequence selected from the group consisting of a) an amino acid sequence selected from the 
group consisting of SEQ ID NO:l -22, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence sdected firom the group consisting of SEQ ID NO: 1-22, c) a 
biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-22, and d) an inununogenic fragment of an amino acid sequence selected from the group consisting 

35 of SEQ ID NO: 1-22. In one alternative, the polynucleotide encodes a polyp^tide sdected from the 

6 
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group consisting of SEQ ID NO: 1-22. In another altCTnative, the polynucleotide is selected from the 
group consisting of SEQ ID NO:23-44. 

Additionally, the invention provides a recombinant polynucleotide comprising a promote 
sequence opo-ably linked to a polynucleotide encoding a polypeptide comprising an amino acid sequence 

5 selected from the group consisting of a) an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-22, b) a naturally occurring amino acid sequence having at least 90% sequence identity to 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, c) a biologically active 
fragmm of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, and d) an 
immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:l - 

10 22, In one alternative, the invention provides a cell transformed with the recombinant polynucleotide. In 
another alternative, the invration provides a transgenic organism comprising the recombinant 
polynucleotide. 

H The invention also provides a method for producing a polypeptide comprising an amino acid 

^ sequmce seleaed from the group consisting of a) an amino acid sequ^ice selected from the group 

W 15 consisting of SEQ ID NO: 1 -22, b) a naturally occurring amino acid sequence having at least 90% 
03 sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:l-22, c) a 

5 biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 

s NO: 1-22, and d) an immunogenic fragment of an amino acid sequence selected from the group consisting 

^ of SEQ ID NO: 1-22. The m^od con^rises a) culturing a cell unda: conditions suitable for expression 

H= 20 of the polypeptide, wherdn said cell is transformed with a recombinant polynucleotide comprising a 
pi promotar sequence operably linked to a polynucleotide encoding the polypeptide, and b) recovering the 

polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid 
25 sequence selected from the group coasisting of SEQ ID NO:l -22. b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-22, c) a biologically active fragment of an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-22, and d) an immunogenic fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-22. 
30 The invention further provides an isolated polynucleotide comprising a polynucleotide sequence 

selected from the group consisting of a) a polynucleotide sequence selected from the group coasisting of 
SEQ ID NO: 23-44, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
35 an RNA equivalent of a)-d). In one alternative, the polynucleotide comprises at least 60 contiguous 
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Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
5 SEQ ID NO:23-44, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, c) a 
polynucleotide sequrace complementary to a), d) a polynucleotide sequence complrairatary to b), and e) 
an RNA equivalent of a)-d). The method comprises a) hybridizing the sample with a probe comprising at 
least 20 contiguous nucleotides comprising a sequence complementary to said target polynucleotide in tiie 
10 sample, and which probe specifically hybridizes to said target polynucleotide, under conditions whereby a 
hybridization complex is formed between said probe and said target polynucleotide or fragmrats thoreof, 
and b) detecting the presence or absence of said hybridization complex, and optionally, if present, the 
amount thereof. In one alternative, the probe comprises at least 60 contiguous nucleotides. 

The invention furthCT provides a method for detecting a target polynucleotide in a sample, said 
W 15 target polynucleotide having a sequaice of a polynucleotide comprising a polynucleotide sequence 

selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
5 SEQ ID NO:23-44, b) a naturally occurring polynucleotide sequmce having at least 70% sequmce 

12 identity to a polynucleotide sequence sheeted from the group consisting of SEQ ID NO:23-44, c) a 

2 polynucleotide sequ^ice complCTimtary to a), d) a polynucleotide sequence complementary to b), and e) 

20 an RNA equivalent of a)-d). The method comprises a) amplifying said target polynucleotide or fragment 
S thereof using polymerase chain reaction amplification, and b) detecting the presence or absence of said 

Hi anq>lified target polynucleotide or fragment th^eof , and, optionally, if present, the amount thareof. 

The invaition further provides a pharmaceutical composition comprising an effective amount of 
a polypeptide con^rising an amino acid sequence selected from the group consisting of a) an amino acid 
25 sequMce selected from the group consisting of SEQ ID NO: 1 -22, b) a naturally occurring amino add 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-22, c) a biologically active fragment of an amino acid sequence sdected from 
the group consisting of SEQ ID NO:l -22, and d) an immunogenic fragment of an amino add sequence 
sdeaed from the group consisting of SEQ ID NO: I -22, and a pharmaceutically acceptable excipient In 
30 one embodiment, the pharmaceutical composition comprises an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1 -22. The invention additionally provides a method of treating a disease 
or condition associated with decreased expression of functional RECAP, comprising administering to a 
patient in need of such treatment the pharmaceutical composition. 

The invOTtion also provides a method for screening a compound for effectiveness as an agonist 
35 of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 

8 
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acid sequence selected from the group consisting of SEQ ID NO: 1-22, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-22, c) a biologically active fragment of an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-22, and d) an immunogenic fragment of an amino acid sequence 
5 selected from the group consisting of SEQ ID NO: 1-22. The method comprises a) exposing a sample 
comprising the polypeptide to a compound, and b) detecting agonist activity in the sample. In one 
altmiative, the invention provides a pharmaceutical composition comprising an agonist compound 
identified by the method and a pharmaceutically acceptable excipient. In another alt^native, the 
invration provides a method of treating a disease or condition associated with decreased expression of 
10 functional RECAP, comprising administering to a patient in need of such treatment the pharmaceutical 
composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as an 
antagonist of a polypeptide comprising an amino add sequence sheeted from the group consisting of a) 
an amino acid sequ^ce selected from the group consisting of SEQ ID NO: 1-22, b) a naturally occurring 

15 amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-22, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-22, and d) an immunogenic fragment of an amino 
acid sequence sd^ected from the group consisting of SEQ ID NO:l -22. The method comprises a) 
exposing a sample comprising the polypeptide to a compound, and b) detecting antagonist activity in 

20 the sample. In one alternative, the inv^tion provides a pharmaceutical composition comprising an 
antagonist compound identified by the method and a pharmaceutically acceptable excipient. In another 
altornative, the invention provides a method of treating a disease or condition associated with 
overexpression of functional RECAP, comprising administ^ing to a patient in need of such treatment 
the pharmaceutical composition. 

25 The invention furth^ provides a method of screening for a compound that specifically binds to 

a polypeptide comprising an amino add sequence selected from the group consisting of a) an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-22, b) a naturally occurring amino acid 
sequence having at least 90% sequ^ce identity to an amino add sequence sdected from the group 
consisting of SEQ ID NO: 1-22, c) a biologically active fragment of an amino acid sequence selected from 

30 the group consisting of SEQ ID NO:l -22, and d) an immunogenic fragment of an amino add sequence 
sdected from the group consisting of SEQ ID NO: 1-22. The method comprises a) combining the 
polypeptide with at least one test compound under suitable conditions, and b) detecting binding of the 
polypeptide to the test compound, th^eby identifying a compound that specifically binds to the 
polypeptide. 

35 The invention further provides a method of screening for a compound that modulates the 

9 
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activity of a polyp^tide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-22, c) a biologically active fragment of an amino 

5 acid sequence selected from the group consisting of SEQ ID NO: 1 -22, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -22. The 
method comprises a) combining the polypeptide with at least one test compound under conditions 
permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 
presence of the test compound, and c) comparing the activity of the polypeptide in the presence of the 

10 test compound with the activity of the polypeptide in the absence of the test compound, wherein a 
change in the activity of the polypeptide in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in altering 
expression of a target polynucleotide, wherein said target polynucleotide comprises a sequence selected 

15 from the group consisting of SEQ ID NO:23-44, the method comprising a) exposing a sample 

comprising the target polynucleotide to a compound, and b) detecUng alt^ed expression of the target 
polynucleotide. 

The invention ftirthar provides a method for assessing toxicity of a test compound, said method 
comprising a) treating a biological sample containing nucleic acids with the test compound; b) 

20 hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID NO:23- 
44, ii) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, iii) a polynucleotide 

25 sequence complementary to i), iv) a polynucleotide sequence complementary to u), and v) an RNA 
equivalent of i)-iv). Hybridization occurs under conditions whereby a specific hybridization complex is 
formed between said probe and a target polynucleotide in the biological sample, said target 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:23-44, ii) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 

30 polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44. iii) a polynucleotide 
sequence complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA 
equivalent of i)-iv). Alternatively, the target polynucleotide comprises a fragment of the above 
polynucleotide sequence; c) quantifying the amount of hybridization complex; and d) comparing the 
amount of hybridization complex in the treated biological sample with the amount of hybridization 

35 complex in an untreated biological sample, wherein a difference in the amount of hybridization 

10 



SUBSTITUTE SHEET (RULE 26) 



wo 01/07612 



PCT/USOO/20035 



complex in the treated biological sample is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 
Table 1 shows polypeptide and nucleotide sequence identification numb^s (SEQ ID NOs), clone 
5 identification numbCTs (clone IDs), cDNA libraries, and cDN A fragments used to assemble full-l^gth 
sequences encoding RECAP. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases used for analysis of RECAP. 

Table 3 shows selected fi-agments of each nucleic acid sequence; the tissue-specific expression 
10 patterns of each nucldc acid sequence as determined by northern analysis; diseases, disordars, or 
conditions associated with these tissues; and the vectcn- into which each cDNA was cloned. 

Table 4 describes the tissues used to construa the cDNA libraries fi*om which cDNA clones 
1^ encoding RECAP were isolated. 

9 Table 5 shows the tools, programs, and algorithms used to analyze the polynucleotides and 

yj 15 polypq>tides of the invmtion, along with applicable descriptions, references, and threshold parameta-s. 

h 

O DESCRIPTION OF THE INVENTION 

Before the present prolans, nucleotide sequences, and methods are descril>ed, it is understood 
that this invention is not limited to the particular machines, mataiaLs and methods described, as these 
20 may vary. It is also to be understood that the terminology used herein is for the piirpose of describing 
particular embodiments only, and is not intaided to limit the scope of the present invration which will be 
1j limited only by the appended claims. 

It must be noted that as used harein and in the appended claims, the singular forms "a," ''an," 
and "the" include plural referaice unless the context clearly dictates otherwise. Thus, for exan^le, a 
25 refarence to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so forth. 

Unless defined otha'wise, all technical and scientific tains used heran have the same meanings 
as commonly unda'Stood by one of ordinary skill in the art to which this invention belongs. Although any 
machines, materials, and methods similar or equivalent to those described herein can be used to practice 
30 or test the present invention, the preferred machines, mata-ials and methods are now described. All 
publications mentioned heran are cited for the purpose of describing and disclosing the cell lines, 
protocols, reagents and vectors which are rqK>rted in the publications and which might be used in 
connection with the invaition. Nothing haan is to be construed as an admission that the invaition is not 
entitled to antedate such disclosure by virtue of prior invention. 
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DEFINITIONS 

"RECAP" refers to the amino acid sequences of substantially purified RECAP obtained from 
any species, particularly a manunalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 
5 The term "agonist" refers to a molecule which intensifies cm* mimics the biological activity of 

RECAP. Agonists may include proteins, nucldc acids, carbohydrates, small molecules, or any other 
compound or con:q)osition which modulates the activity of RECAP eith^ by directly interacting with 
RECAP or by acting on componms of the biological pathway in which RECAP participates. 

An "allelic variant" is an altanative form of the gene encoding RECAP. Allelic variants may 
10 result from at least one mutation in the nucldc acid sequence and may result in altered mRNAs or in 
polyp^tides whose structure or function may or may not be altered. A gene may have none, one, or 
many alldic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are genially ascribed to natural deletions, additions, or substitutions of nucleotides. Each 
of these types of changes may occur alone, or in combination with the others, one or more times in a 
15 givCT sequence. 

"Altered" nucleic acid sequences encoding RECAP include those sequmces with deletions, 
insertions, or substitutions of diffa-mt nucleotides, resulting in a polypq}tide the same as RECAP or a 
polypeptide with at least one functional charactmstic of RECAP. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 

20 the polynucleotide encoding RECAP, and improp^ or unexpected hybridization to allelic variants, with a 
locus other than the normal chromosomal locus for the polynucleotide sequence encoding RECAP. The 
encoded protein may also be "altered," and may contain deletions, insmions, or substitutions of amino 
acid residues which produce a silent change and result in a functionally equivalent RECAP. Deliberate 
amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, 

25 hydrophobicity, hydrophilicity, and/cn- the amphipathic nature of the residues, as long as the biological or 
immunological activity of RECAP is retained. For example, negatively charged amino acids may 
include aspartic acid and glutamic acid, and positively charged amino acids may include lysine and 
arginine. Amino acids with uncharged polar side chains having similar hydrophilicity values may 
include: asparagine and glutamine; and serine and threonine. Amino acids with uncharged side chains 

30 having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and alanine; 
and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, pq>tide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring protein 

35 molecule, "amino acid sequence" and like torms are not meant to limit the amino acid sequence to the 
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complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is genially carried out using polymerase chain reaction (PGR) technologies well known in 
the art. 

The term ""antagonist" refers to a molecule which inhibits or attenuates the biological activity of 
RECAP. Antagonists may include proteins such as antibodies, nucleic acids» carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of RECAP either by 
directty interacting with RECAP or by acting on conqx)nents of the biological pathway in which RECAP 
participates. 

The term "antibody" refors to intact immunoglobulin molecules as well as to fragments thereof, 
such as Fab, F(ab')2> fragments^ which are capable of binding an q}itopic det^minant. 

Antibodies that bind RECAP polypeptides can be prq)ared using intact polypeptides or using fragmrats 
containing small pq)tides of interest as the immunizing antigen. The polypq)tide or oligopeptide used to 
immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, or 
synthesized chemically, and can be conjugated to a carri^ protein if desired. Commonly used carria's 
that are diemically coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole limpet 
hemocyanin (KLH). The coupled pq>tide is then used to immunize the animal. 

The term "'antig^c determinant" refits to that region of a molecule (Le., an q)itope) that makes 
contact with a particular antibody. When a protein or a fragmrat of a protein is used to immunize a host 
animal, numerous regions of the protein may induce the production of antibodies which bind specifically 
to antigenic determinants (particular regions or three-dimensional structures on the prot^). An antigenic 
determinant may compete with the intact antigen (i.e., the immunogen used to elicit the immune response) 
for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" (coding) 
strand of a specific nucldc acid sequence. Antisense compositions may include DNA; RNA; peptide 
nucleic acid (PNA); oligonucleotides having modified backbone linkages such as phosphorothioates, 
methylphosphonates, or benzylphosphonates; oligonucleotides having modified sugar groups such as 2'- 
methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having modified bases such as S- 
methyl cytosine, 2'-deoxyuracil, or 7-deaza-2*-deoxyguanosine. Antisense molecules may be produced by 
any method including chemical synthesis or transcription. Once introduced into a cell, the 
complementary antisense molecule base-pairs with a naturally occurring nucleic acid sequence produced 
by the cell to form duplexes which block either transcription or translation. The designation "negative" 
or "minus" can refer to the antisense strand, and the designation "positive" or "plus" can refer to the 
sense strand of a refo'ence DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
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functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic RECAP, or of any oligC9)q7tide thereof, 
to induce a specific immune response in appropriate animals or cdls and to bind with specific antibodies. 
"Complementary" describes the relationship between two single-stranded nucleic acid sequences 
5 that anneal by base-pairing. For example, 5-AGT-3* pairs with its complement, 
3'-TCA-5\ 

A "composition comprising a given polynucleotide sequence" and a "composition comprising a 
given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
amino acid sequence. The conq^osition may comprise a dry formulation or an aqueous solution. 
10 Compositions comprising polynucleotide sequences encoding RECAP or fragments of RECAP may be 
^ployed as hybridization probes. The probes may be stored in fi-eeze-dried form and may be associated 
with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be dq>loyed in an 
1^ aqueous solution containing salts (e.g., NaCl), det^gents (e.g., sodium dodecyl sulfate; SDS), and ottier 

g components (e.g., Denhardt's solution, dry milk, salmon sp^m DNA, etc.). 

W 15 "Consensus sequence" refit's to a nucleic acid sequence which has been subjected to repeated 

DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (PE Biosystems, 
Q Foster City CA) in the 5* and/or the 3* direction, and resequenced, or which has been assembled fi-om one 

^ or more overlapping cDNA, EST, (k genomic DNA fragments using a computer program for fii-agment 

assembly, such as the GELVIEW fragment assembly system (GCG, Madison WI) or Phrap (UnivCTsity 
20 of Washington, SeattieWA). Some sequences have bera both extended and assembled to produce the 

ffi 

™ consensus sequence. 

fu "Conservative amino acid substitutions" are those substitutions that are predicted to least 

interfere with the propmies of the original protein, i.e., the structure and especially the function of the 
protdn Ls conserved and not significantiy changed by such substitutions. The table below shows amino 
25 acids which may be substituted far an original amino acid in a protdn and which are regarded as 
cons^vative amino acid substitutions. 

Original Residue ConsCTvative Substitution 

Ala Gly, Ser 

Arg His, Lys 

30 Asn Asp, Gin, His 

Asp Asn, Glu 

Cys Ala, Ser 

Gin Asn, Glu, His 

Glu Asp, Gin, His 

35 Gly Ala 

His Asn, Arg, Gin, Glu 

He Leu, Val 

Leu He, Val 



14 



SUBSTITUTE SHEET (RULE 26) 



wo 01/07612 



PCTAJSOO/20035 



Lys 
Met 
Phe 
Ser 
Thr 
Trp 
Tyr 
Val 



Arg, Gin, Glu 
Leu, lie 

His, Met, Leu, Trp, Tyr 



5 



Cys,Thr 
Ser, Val 
Phe, Tyr 



His, Phe, Trp 
He, Leu, Thr 



10 



Conservative amino acid substitutions gOTcrally maintain (a) the structure of the polypeptide 



backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge oc hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
15 absence of one or more amino acid residues or nucleotides. 

The t^m "derivative" ref^s to a chemically modified polynucleotide or polypeptide. Chemical 
modifications of a polynucleotide sequence can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains 
at least one biological or immunological ftinction of the natural molecule. A d^ivative polypeptide is one 
20 modified by glycosylation, pegylation, or any similar process that retains at least one biological or 
immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a rep)orter molecule or enzyme that is capable of gen^ating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 



25 identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up to 
the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 15, 
16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 

30 residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 

35 embodiments. 

A fragment of SEQ ID NO:23-44 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO:23-44, lor example, as distinct from any other sequence in the 



A "fragment" is a unique portion of RECAP or the polynucleotide encoding RECAP which is 
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genome from which the fragment was obtained. A fragment of SEQ ID NO:23-44 is useful, for 
example, in hybridization and amplificaUon technologies and in analogous methods that disUnguish 
SEQ ID NO:23-44 from related polynucleotide sequences. The precise length of a fragment of SEQ ID 
NO:23-44 and the region of SEQ ID NO:23-44 to which the fragment corresponds are routinely 
5 determinable by one of ordinary skill in the art based on the intended purpose for the fragment 

A fragment of SEQ ID NO: 1-22 is encoded by a fragment of SEQ ID NO:23-44. A fragment 
* of SEQ ID NO: 1-22 comprises a region of unique amino acid sequence that specifically identifies SEQ 

ID NO: 1-22. For example, a fragment of SEQ ID NO: 1-22 is useful as an immunogenic peptide for the 
development of antibodies that specifically recognize SEQ ID NO: 1-22. The precise length of a 
10 fragment of SEQ ID NO: 1-22 and the region of SEQ ID NO: 1-22 to which the fragment corresponds 
are routinely determinable by one of ordinary skill in the art based on the intended purpose for the 
fragment. 

ilj, a "full-length" polynucleotide sequence is one containing at least a translation initiation codon 

2 (e.g., metiiionine) followed by an open reading frame and a translation tCTmination codon, A "full- 
U 15 lengtii" polynucleotide sequaice encodes a "ftill-length" polypq)tide sequence. 

"Homology*' ref^s to sequence similarity ot, intCTChangeably, sequence id^tity, between two or 
O more polynucleotide sequences or two or more polypqptide sequences. 

T The terms **percOTt identity'* and "% idcmtity," as applied to polynucleotide sequences, refer to 

P the po-centage of residue matches between at least two polynucleotide sequences aligned using a 

20 standardized algorithm. Such an algoritiim may insert, in a standardized and reproducible way, gaps in 
5 the sequences bdng compared in crder to optimize alignment between two sequences, and therefore 
ry achieve a more meaningful conq^arison of the two sequences. 

Po-cent identity between polynucleotide sequences may be detCTmined using the default 
parameters of the CLUSTAL V algorithm as incorporated into tiie MEGALIGN version 3.12e sequ^ce 
25 alignment program. This program is part of tiie LASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. For 
pairwise alignments of polynucleotide sequences, the default parameters are set as follows: Ktuple=2, 
gap penalty=5. window=4, and "diagonals saved"=4. The "wdghted" residue weight table is selected as 
30 the default Percent identity is rqx)rted by CLUSTAL V as the "percent similarity" between aligned 

polynucleotide sequmces. 

Alto-nativdy, a suite of commonly used and fredy available sequence con5)arison algorithms is 
provided by the National Center tor Biotechnology Information (NCBI) Basic Local Alignment Search 
Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several 
35 sources, including the NCBI, Bethesda, MD, and on the Internet at 
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http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite Includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with othCT 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
5 Sequences" can be accessed and used intCTactively at http://www.ncbi.nlm.nih.gov/gQrl7bl2.html. The 
"BLAST 2 SequMces" tool can be used for both blastn and blastp (discussed below). BLAST programs 
are conmionly used with gap and other parameters set to default settings. For example, to compare two 
nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 2.0.12 (April-21- 
2000) set at default parameta-s. Such default paramet^s may be. for example: 
10 Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

M. Open Gap: 5 and Extension Gap: 2 penalties 

O 

p. Gap X drop-off: 50 

W 15 Expect: 10 

'^Q Word Size: 12 

5 Filter: on 

s PCTcent identity may be measured ovar the length of an entire defined sequmce, for example, as 

P defined by a particular SEQ ID number, or may be measured over a shortar length, for example, over the 

20 length of a fragment taken from a larger, defined sequ^ice, for instance, a fragmrat of at least 20, at least 

ffi 

g 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such lengths 

nj are exemplary only, and it is undorstood that any fragment length supported by the sequences shown 

herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which percentage 
identity may be measured. 
25 Nucleic acid sequences that do not show a high degree of identity may nevertheless racode 

similar amino acid sequences due to the degenwacy of the genetic code. It is undo-stood that changes in a 
nucleic acid sequence can be made using this degeno-acy to produce multiple nucleic acid sequences that 
all encode substantially the same protdn. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refor to the 
30 percentage of residue matches between at least two polypeptide sequences aligned using a standardized 
algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment methods take 
into account conservative amino acid substitutions. Such conservative substitutions, explained in more 
detail above, generally preserve the charge and hydrophobicity at the site of substitution, thus preserving 
the structure (and therefore function) of the polypeptide. 
35 Percent identity between polypeptide sequences may be determined using the default parameters 
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of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 12e sequence alignment 
program (described and ref^enced above). For pairwise alignments of polypeptide sequences using 
CLUSTAL V, the default parameters are set as follows: Ktuple^l, gap penalty=3, window=5, and 
"diagonals saved"=5. The PAM250 matrix is sdected as the default residue w«ght table. As with 
5 polynucleotide alignments, the pCTcent identity is rqported by CLUSTAL V as the "percent similarity*' 
between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
conqjarison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0.12 
(Apr-21-2000) with blas^ set at default parameters. Such default parameters may be, for example; 
10 Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 

Gap X drop-off: 50 

Expect: 10 

Word Size: 3 
15 Filter: on 

Percrat identity may be measured over the length of an entire defined polypeptide sequence, for 
example, as defined by a particular SEQ ID number, or may be measured ova- a shorter length, for 
example, over the length of a fragment takm from a larger, defined polypeptide sequence, for instance, a 
fragment of at least 15. at least 20. at least 30, at least 40. at least 50, at least 70 or at least 150 

20 contiguous residues. Such lengths are exemplary only, and it is undCTStood that any fragment length 
suppcMted by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a length ovct which pCTCoitage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain DNA 
sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for chromosome 

25 replication, segregation and maintenance. 

The torn "humanized antibody" refers to an antibody molecule in which the amino acid sequence 
in the non-antigen binding regions has been altared so that the antibody more closely resembles a human 
antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 

30 complementary strand through base pairing undo* defined hybridization conditions. Specific 

hybridization is an indication that two nucleic acid sequences share a high degree of complOTientarity. 
Specific hybridization complexes form under pea-missive annealing conditions and remain hybridized after 
die '^washing" step(s). The washing step(s) is particularly important in determining the stringOTcy of tiie 
hybridization process, with more stringent conditions allowing less non-specific binding, i.e., binding 

35 between pairs of nucleic acid strands that are not parfectiy matched. Po-missive conditions for annealing 
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of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and may be 
consistrat among hybridization expariments, whereas wash conditions may be varied among experiments 
to achieve the desired stringency, and therefore hybridization specificity. Permissive annealing conditions 
occur, for example, at 68*^0 in the presence of about 6 x SSC, about 1% (w/v) SDS, and about 100 

5 pg/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the tempwature 
under which the wash st^ is carried out Such wash temperatures are typically selected to be about SX^ 
to 20°C lower than the therma] melting point (T^) for the specific sequence at a defined ionic strength and 
pH. The T„, is the temperature (under defined ionic strength and pH) at which 50% of the target 

10 sequence hybridizes to a p^fectly matched probe. An equation for calculating T^, and conditions for 
nucleic acid hybridization are well known and can be found in Sambrook, J. et aL, 1989, Molecular 
Cloning: A Laboratory Manual . 2"** ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapto* 9, 

High stringency conditions for hybridization between polynucleotides of the present invoition 

15 include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0. 1 % SDS, for 1 hour. 
Alt^nativdy, temperatures of about 65 °C, dO'^C, 55 ^'C, or 42**C may be used. SSC concentration may 
be varied firom about 0.1 to 2 x SSC, with SDS being present at about 0.1%. Typically, blocking 
reagents are used to block non-specific hybridizatioa Such blocking reagents include, for instance, 
sheared and denatured salmon sp^m DNA at about 100-200 ^ig/nH. Organic solvent, such as formamide 

20 at a concentration of about 35-50% v/v, may also be used under particular circumstances, such as for 
RNAiDNA hybridizations. Useful variations on these wash conditions will be readily apparent to those 
of ordinary skill in the art. Hybridization, particularly undar high stringency conditions, may be 
suggestive of evolutionary similarity between the nucleotides. Such similarity is strongly indicative of a 
similar role for the nucleotides and their encoded polypq^tides. 

25 The term "hybridization complex" refers to a complex formed between two nucldc acid 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A hybridization 
complex may be formed in solution (e.g.. Cot or Rot analysis) or formed between one nucleic acid 
sequence present in solution and another nucleic acid sequence immobilized on a solid support (e.g., 
paper, mmibranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells or 

30 their nucleic adds have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide sequence 
resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with infiammation, trauma, immune 
disord^s, or infectious or genetic disease, etc. These conditions can be characterized by expression of 

35 various factc^s, e.g., cytokines, chemokines, and other signaling molecules, which may affect cellular and 
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systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of RECAP which is 
capable of eliciting an immune response when introduced into a living organism, for example, a mammal. 
The term "immunog^c fragment" also includes any polypq3tide or oligopeptide fragment of RECAP 
5 which is useful in any of the antibody production methods disclosed herein or known in the art. 

The term "microarray" refo-s to an arrangement of a plurality of polynucleotides, polypeptides, 
or other chemical compounds on a substrate. 

The tCTms "dement" and "array element" refa- to a polynucleotide, polypeptide, or othar 
chemical compound having a unique and deOned position on a microarray. 
10 The term "modulate" refers to a change in the activity of RECAP. For example, modulation may 

cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
functional, or immunological properties of RECAP. 

The phrases "nucldc acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refa: to DNA or RN A of gaiomic or 
15 synthetic origin which may be single-stranded or double-stranded and may r^resent the sense or the 
antisense strand, to pqptide nucldc acid (PNA), or to any DNA-like or RN A-like mat^ial. 

"Operably linked" refiM^s to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
20 sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

*Tq)tide nucleic acid" (PNA) refars to an antisense molecule or anti-gene agent which comprises 
an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of amino add 
residues ending in lysine. The tominal lysine confers solubility to the composition, PNAs preferentially 
25 bind conq)lementary single suranded DNA or RNA and stop transcript elongation, and may be pegylated 
to extend their lifespan in the cell. 

"Post-translational modification" of an RECAP may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and othca* modifications known in the 
art. These processes may occur synthetically or biochemically. Biochemical modifications will vary by 
30 cell type depending on the enzymatic milieu of RECAP. 

"Probe" refers to nucleic acid sequences encoding RECAP, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are isolated 
oligonucleotides or polynucleotides attached to a detectable label or reports molecule. Typical labels 
include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primes" are short nucleic 
35 acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
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complementary base-pairing. The prim©- may then be extended along the target DNA strand by a DN A 
polymerase enzyme. Primer pairs can be used for amplitication (and identification) of a nucleic acid 
sequence, e.g.. by the polymerase chain reaction (PGR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
5 nucleotides of a known sequence. In ordo: to rahance specificity, longer probes and primers may also be 
employed, such as probes and primers that conqjrise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, or at 
least 150 consecutive nucleotides of the disclosed nuclac acid sequCTces. Probes and primers may be 
considerably longCT than these examples, and it is und^stood that any length suppcorted by the 
specification, including the tables, figures, and Sequence Listing, may be used. 
10 Methods for preparing and using probes and prime's are described in the references, for sample 

Sambrook, J. et al., 1989, Molecular Cloning: A Laboratory Manual . 2™* ed., vol. 1-3, Cold Spring 
Harbor Press, Plainview NY; Ausubel, P.M. et al.,1987. Current Proto cols in Molecular Biology, Greene 
Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al.. 1990, PCR Protocols, A Guide to 
Methods and Applications . Academic Press, San Diego C A. PCR prima- pairs can be derived from a 

wss? 

W 15 known sequence, for example, by using computer programs intended for that purpose such as Primer 
^ (Version 0.5 , 199 1 , Whitdiead Institute for Biomedical Research, Cambridge MA). 

5 Oligonucleotides for use as prime's are selected using software known in the art for such 

3?' purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 

p 100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 

20 nucleotides from an input polynucleotide sequence of up to 32 Idlobases. Similar prima* selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
prima- selection program (available to the public from ttie Genome Cento* at University of Texas South 
West Medicsd Cento*, Dallas TX) is capable of choosing specific primers from megabase sequences and 
is thus useful for designing primers on a goiome-wide sc(^. The Prima*3 primo selection program 
25 (available to the public from the Whitdiead Institute/MIT Cento for Genome Research, Cambridge MA) 
allows the user to input a "mispriming library," in which sequences to avoid as primo binding sites are 
user-specified. PrimorS is useful, in particular, for the selection of oligonucleotides for microarrays. 
(The source code for the latter two primer selection programs may also be obtained from their respective 
sources and modified to meet the user's specific needs.) The PrimeGen program (available to the pubUc 
30 from the UK Human Genome Mapping Project Resource Centre* Cambridge UK) designs primors based 
on multiple sequence alignments, thereby allowing selection of primers that hybridize to dther the most 
consoved or least coaserved regions of aligned nucldc acid sequences. Hence, ttiis program is useful for 
identification of both unique and conserved oligonucleotides and polynucleotide fragments. The 
oligonucleotides and polynucleotide fragments identified by any of the above selection methods are useful 
35 in hybridization technologies, for example, as PCR or sequencing primers, microarray elements, or 



m 
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specific probes to identify fiilly or partially complementary polynucleotides in a sample of nucleic acids. 
Methods of oligonucleotide selection are not limited to those described above. 

A "recombinant nucleic acid'* is a sequ^ce that is not naturally occurring or has a sequence that 
is made by an artificial combination of two or moare otherwise s^arated segments of sequence. This 
5 artificial combination is often acconqjlished by chemical synthesis or, more commonly, by the artificial 
manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques such as those 
described in Sambrook, supra . The tarm recombinant includes nucleic acids that have been alt^ed solely 
by addition, substitution, or deletion of a portion of the nucldc acid. Frequentiy, a recombinant nucleic 
acid may include a nucleic acid sequence operably linked to a promota- sequence. Such a recombinant 
10 nucldc acid may be part of a vector that is used, for example, to transfcHTn a cell. 

Altmiatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal vfh^dn the recombinant nucleic acid is 
expressed, inducing a protective immunological resi)onse in the mammal. 

Q A "regulatory element" refers to a nucleic acid sequence usually derived fi-om untranslated 

O 

hi regions of a gene and includes enhancers, promotes, introns, and 5* and 3* untranslated regions (UTRs). 

^ Regulatory elements interact with host or viral protdns which control transcription, translation, or RNA 

O stability. 

8 

^ "Rq)orter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 

O amino acid, or antibody. Reports molecules include radionuclides; ^ymes; fluorescmt, 

1^ 20 ch^luminescmt, or chromogenic agents; substrates; cofactors; inhibit(»-s; magnetic particles; and other 

IS moieties known in the art. 

O 

An "RNA equivalent." in refCTence to a DNA sequence, is composed of the same linear sequence 
of nucleotides as the refo-ence DNA sequence with the exception that all occurrences of the nitrogenous 
base thymine are r^laced with uracil, and the sugar backbone is composed of ribose instead of 
25 deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic acids 
encoding RECAP, or fragments thereof, or RECAP itself, may comprise a bodily fluid; an extract from a 
cell, chromosome, organelle, or membrane isolated from a cdl; a cell; genomic DNA, RNA, or cDNA, in 
solution or bound to a substrate; a tissue; a tissue print; etc. 
30 The terms "specific binding" and "specifically binding" refer to that interaction between a protein 

or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or synthetic 
binding composition. The into-action is dependent upon the presence of a particular structure of the 
protein, e.g., the antig^ic determinant or epitope, recognized by the binding molecule. For example, if 
an antibody is specific for epitope "A," the presence of a polypeptide comprising the epitope A, or the 
35 presence of free unlabeled A. in a reaction containing free labeled A and the antibody will reduce the 
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amount of labeled A that binds to the antibody. 

The tCTm "substantially purified" refers to nucleic acid or amino acid sequences that are removed 
from their natural environment and are isolated or sq>arated, and are at least 60% free, preferably at least 
75% free, and most prefo-ably at least 90% free from other components with which they are naturally 
5 associated. 

A "substitution" refers to the replacem^t of one or more amino acid residues or nucleotides by 
diffluent amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, chips, 
slides, wafCTS, fibCTS, magnetic or nonmagnetic beads, gds, tubing, plates, polymo^s, microparticles and 
10 capillaries. The substrate can have a variety of surface forms, such as wells, trenches, pins, channels and 
pores, to which polynucleotides or polypq>ti(tes are bound. 

A "transcript image" refers to the collective pattern of gene expression by a particular cell type 
or tissue under given conditions at a given time. 

'Transformation" describes a process by which exogenous DNA is introduced into a recipient 
15 cell. Transformation may occur undCT natural or artificial conditions according to various methods well 
known in the art, and may rdy on any known method for the insertion of fordgn nucleic acid sequences 
into a prokaryoUc or eukaryotic host c^l. The method for transformation is seleaed based on the type of 
host cell being transformed and may include, but is not limited to, bacto-iophage or viral infection, 
dectrop(»-ation, heat shock, lipofection, and particle bombardm^t. The term "transformed" cells 
20 includes stably transformed cells in which the insmed DNA is capable of r^lication either as an 

autonomously replicating plasmid or as part of the host chromosome, as well as transiently transformed 
cells which express the inserted DNA cm* RN A for limited p^iods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to animals 
and plants, in which one or more of the cells of the organism contains heterologous nucleic acid 
25 introduced by way of human intervention, such as by transgenic techniques well known in the art. The 
nucleic add is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 
by way of deliberate genetic manipulation, such as by microinjection or by infection with a 
recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in vitro 
fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The transgenic 
30 organisms contemplated in accordance with the present invention include bacteria, cyanobacteria, 
frmgi, plants, and animals. The isolated DNA of the present invention can be introduced into the host 
by methods known in the art, for example infection, transfection, transformation or transconjugation. 
Techniques for transferring the DNA of the present invention into such organisms are widely known 
and provided in references such as Sambrook et al. (1989), supra . 
35 A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
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least 40% sequence identity to the particular nucleic acid sequence over a cmain length of one of the 
nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Vision 2.0.9 (May-07-1999) 
set at default paramet^s. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%, at least 90%» at least 95% or at least 98% or gjreaier 
5 sequCTce identity ova- a catain defined length. A variant may be described as, for example, an "allelic" 
(as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have significant 
idMtity to a refo-ence molecule, but will generiMy have a greater ot lesser number of polynucleotides due 
to alternative splicing of exons during mRN A processing. The corresponding polypqjtide may possess 
additional functional domains or lack domains that are present in the refermce molecule. Species 
10 variants are polynucleotide sequences that vary from one species to another. The resulting polypeptides 
generally will have significant amino acid identity relative to each other. A polymorphic variant is a 
variation in the polynucleotide sequrace of a particular gene betwem individuals of a given species. 
H Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in which the 
g polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be indicative of, for 
yj 15 exan^le, a certadrx population, a disease state, or a prq^ensity for a disease state. 
2 A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 

least 40% sequrace identity to die particular polypqitide sequence over a certain length of one of the 
polypq>tide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 
20 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greats- sequence identity 
over a certain defined length of one of the polypeptides, 
ry THE INVENTION 

The invention is based on the discovay of new human recq)tors and associated protdns 
(RECAP), the polynucleotides CTCoding RECAP, and the use of these compositions for the diagnosis, 
25 treatment, or prevention of neurological disord^-s; immunological disorders, including 
autoimmune/infiammatory disord^s; and cell proliferative disordo-s, including cancer. 

Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 
RECAP. Columns 1 and 2 show the sequence idmtification numbCTS (SEQ ID NOs) of the polypeptide 
and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which 
30 nucleic acids encoding each RECAP w^e identified, and column 4 shows the cDNA libraries fi^om which 
these clones wctc isolated. Column 5 shows Incyte clones and their corresponding cDNA libraries. 
Clones for which cDNA libraries are not indicated w^e derived from pooled cDNA libraries. In some 
cases, GenBank sequence identifiers are also shown in column 5. The Incyte clones and GenBank cDNA 
sequences, where indicated, in colunm 5 wo-e used to assemble the consrasus nucleotide sequence of each 
35 RECAP and are useful as fragments in hybridization technologies. 
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The columns of Table 2 show various properties of each of the polyp^tides of the invention: 
column 1 refCTences the SEQ ID NO; column 2 shows the numb^ of amino acid residues in each 
polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 

5 shows homologous sequences as identified by BLAST analysis along with relevant citations, all of which 
are expressly incorpcM-ated by refiorence herein in their entirety; and column 7 shows analytical methods 
and in some cases, searchable databases to which the analytical methods wa-e applied. The methods of 
column 7 wore used to characterize each polypeptide through sequence homology and protein motifs. 
The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 

10 associated with nucleotide sequences ^coding RECAP. The first column of Table 3 lists the nucleotide 
SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1 . These fragments are 
useful, for example, in hybridization or amplification technologies to identify SEQ ID NO:23-44 and to 
distinguish between SEQ ID NO:23^ and related polynucleotide sequences. The polypeptides 
encoded by these fragments are useful, for exaiiq>le, as immunogenic pq>tides. Column 3 lists tissue 

15 categOTies which express RECAP as a fraction of total tissues expressing RECAP. Column 4 lists 
diseases, disorders, or conditions associated with those tissues expressing RECAP as a fraction of total 
tissues expressing RECAP. Column 5 lists the vectors used to subclone each cDNA library. Of 
particular interest is the expression of SEQ ID NO:l 1 in hematopoi^c/immune tissues and the 
expression of SEQ ID NO: 14 in reproductive tissues. 

20 The columns of Table 4 show descriptions of the tissues used to construa the cDNA libraries 

from which cDNA clones encoding RECAP wctc isolated. Column 1 references the nucleotide SEQ ID 
NOs, column 2 shows the cDN A libraries from which these clones were isolated, and column 3 shows the 
tissue origins and other descriptive information r^evant to the cDNA libraries in colunm 2. 

The invention also ^compasses RECAP variants. A preferred RECAP variant is one which has 

25 at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid sequence 
identity to the RECAP amino acid sequence, and which contains at least one functional or structural 
characteristic of RECAP. 

The invention also encompasses polynucleotides which encode RECAP. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence seleaed from 

30 the group coasisting of SEQ ID NO:23-44, which encodes RECAP. The polynucleotide sequences of 
SEQ ID NO:23-44, as presOTted in the Sequrace Listing, embrace the equival^t RN A sequences, 
wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone is 
composed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence encoding RECAP. In 

35 particular, such a variant polynucleotide sequence will have at least about 70%, or alto-nativdy at least 
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about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
encoding RECAP. A particular aspea of the invention encompasses a variant of a polynucleotide 
sequence comprising a sequence selected from the group consisting of SEQ ID NO:23-44 which has at 
least about 70%, or alternatively at least about 85%, or even at least about 95% polynucleotide sequence 
5 identitytoanucleicacidsequencesdectedfromthegroupconsistingof SEQ IDNO:23-44. Any one of 
the polynucleotide variants described above can encode an amino acid sequence which contains at least 
one functional or structural charactmstic of RECAP. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 
code, a multitude of polynucleotide sequences encoding RECAP, some bearing minimal similarity to the 
10 polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the 
invention contenqplates each and every possible variation of polynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations are made in accordance 
M- with the standard triplet gen^c code as applied to the polynucleotide sequence of naturally occurring 
p RECAP, and all such variations are to be considered as being spedtically disclosed 
yJ 15 Although nucleotide sequences which encode RECAP and its variants are gena^ally capable of 

!q hybridizing to the nucleotide sequence of the naturally occurring RECAP under appropriately sdected 
^ conditions of stringency, it may be advantageous to produce nucleotide sequences encoding RECAP or its 
T derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring 

codons. Codons may be selected to increase the rate at which expression of the peptic^ occurs in a 
H 20 particular prokaryotic or eukaryotic host in accordance with the frequency with which particular codons 
2 are utilized by the host Otha: reasons for substantially altering the nucleotide sequence encoding 

fU RECAP and its derivatives without altering the «icoded amino acid sequences include the production of 

RN A transcripts having more desirable propoties, such as a greater half-life, than transcripts produced 
from the naturally occurring sequence. 
25 The invention also racompasses production of DNA sequences which encode RECAP and 

RECAP derivatives, or fragments thereof, entirdy by synthetic chemistry. Aftcar production, the synthetic 
sequence may be insated into any of the many available expression vectors and cell systems using 
reagents v/ekl known in the art. Moreovo", synthetic chanistry may be used to introduce mutations into a 
sequence mcoding RECAP or any fragment thereof. 
30 Also encompassed by the invention are polynucleotide sequences that are capable of hybridizing 

to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID NO:23-44 and 
fragments th^eof undo: various conditions of stringency. (See, e.g., WaW, G.M. and S.L. Bergo" (1987) 
Methods Enzymol. 152:399-407; Kimmd, A.R. (1987) Methods Enzymol. 152:507-511.) Hybridization 
conditions, including annealing and wash conditions, are described in "Detinitions." 
35 Methods for DNA sequencing are well known in the art and may be used to practice any of the 
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embodiments of the invention. The methods may employ such enzymes as the Klenow fragment of DN A 
polyma^ase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymearase (PE Biosystems, Fostar 
City CA), thermostable T7 polym^ase (Amersham Pharmacia Biotech, Piscataway NJ), or combinations 
of polymo'ases and proofreading exonucleases such as those found in the ELONG ASE anq}lification 
5 system (Life Technologies, Gaitharsburg MD). Preferably, sequmce preparation is automated with 
machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV). PTC200 thea-mal 
cycler (MJ Research, Watotown MA) and ABI CATALYST 800 thermal cycler (PE Biosystems). 
Sequencing is then carried out using dth^ the ABI 373 or 377 DNA sequencing system (PE 
Biosystems), the MEG ABACE 1000 DNA sequmcing system (Molecular Dynamics, Sunnyvale CA), or 
10 other systems known in the art. The resulting sequences are analyzed using a variety of algorithms which 
are well known in the art. (See, e.g., Ausubd, P.M. (1997) Short Protocols in Molecular Biolosv . John 
Wiley & Sons, New York NY, unit 7.7; Meyers, R. A. (1995) Molecular Biologv and Biotechnology , 
Wiley VCH, New York NY, pp. 856-853.) 

The nucleic add sequmces encoding RECAP may be extended utilizing a partial nucleotide 
W 15 sequence and employing various PCR-based methods known in the art to detect upstream sequences, such 
Iq as promoters and regulatory el^nents. For example, one method which may be employed, restriction-site 

iJ PCR, uses univ^sal and nested primars to amplily unknown sequence from genomic DNA within a 

3 cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) Another method, 

2 inverse PCR, uses primars that extend in div^gent directions to amplify unknown sequence from a 

H 20 circularized ten^late. The tmiplate is doived from restriction fragments comprising a known gnomic 
M locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186.) A 

Oj third method, capture PCR, involves PCR aiiq)lification of DNA fragments adjacent to known sequences 

in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. (1991) PCR Methods 
Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and ligations may be used to 
25 insert an ^gineored double-stranded sequence into a region of unknown sequence before p^forming 
PCR. Othar methods vMch may be used to retrieve unknown sequmces are known in the art. (See, e.g., 
ParkCT, J.D. et al. (1991) Nucldc Acids Res. 19:3055-3060). Additionally, one may use PCR, nested 
primers, and PROMOTERHNDER libraries (Clontech, Palo Alto CA) to walk genomic DNA. This 
procedure avoids the need to screen libraries and is useful in finding intron/exon junctions. For all PCR* 
30 based methods, primes may be designed using comm^dally available software, such as OLIGO 4.06 
Primer Analysis software (National Biosciences, Plymouth MN) or another appropriate program, to be 
about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the 
template at tempca-atures of about 68**C to 72°C. 

When screening for full-length cDNAs, it is prefea-able to use libraries that have been 
35 size-selected to include larg^ cDN As. In addition, random-primed libraries, which often include 
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sequences containing the 5* regions of genes, are preferable for situations in which an oligo cl(T) library 
does not yield a full-length cDNA. Genomic libraries may be useHil for extension of sequence into 5' 
non-transcribed regulatory regions. 

Capillary electrophoresis syst^ns which are commercially available may be used to analyze the 
S size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polym^s for electrophoretic separation, four different nucleotide- 
specific, las^-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
knitted waveloigths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g,, GENOTYPER and SEQUENCE NAVIGATOR, PE Biosystems), and the entire process 

10 from loading of samples to compute analysis and electronic data display may be computer controlled. 
Capillary electrophoresis is especially preferable for sequencing small DNA firagmmts which may be 
present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode RECAP may be cloned in recombinant DNA molecules that direct expression of RECAP, or 

15 fragments or functional equivalents thereof, in appropriate host cells. Due to the inho-mt degeneracy of 
the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be produced and used to express RECAP. 

The nucleotide sequences of the preset inv^tion can be ^gineered using methods generally 
known in the art in order to alto* REC AP-encoding sequences for a variety of purposes including, but not 

20 limited to, modification of the cloning, processing, and/or expression of the gene product. DNA shuffling 
by random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be 
used to ragineer the nucleotide sequences. For example, oligonucleotide-mediated site-directed 
mutagenesis may be used to introduce mutations that create new restriction sites, alter glycosylation 
pattens, diange codon prefi^-ence, produce splice variants, and so forth. 

25 The nucleotides of the present invention may be subjected to DNA shuffling techniques such as 

MOLECULARBREEDING (Maxygen Inc., Santa Qara CA; described in U.S. Patent Numb^ 
5.837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
BiotechnoL 17:259-264; andCrameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or improve 
the biological properties of RECAP, such as its biological or enzymatic activity or its ability to bind to 

30 other molecules or compounds. DNA shuffling is a process by which a library of gene variants is 
produced using PCR-mediated recombination of gene fragments. The library is then subjected to 
selection or screening procedures that identify those gene variants with the desired properties. These 
preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and 
selection/screening. Thus, genetic div^sity is created through "artiflcial" breeding and rapid molecular 

35 evolution. For example, fragments of a single gene containing random point mutations may be 
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recombined, screened, and then reshuffled until the desired properties are optimized. Alternatively, 
fragments of a given gene may be recombined with fragments of homologous genes in the same gene 
family, either from the same or different species, thereby maximizing the genetic diversity of multiple 
naturally occurring genes in a directed and controllable manner. 
5 In another embodiment, sequences encoding RECAP may be synthesized, in whole or in part, 

using chemical methods wdl known in the art. (See, e.g., Caruthers, M,H. et al. (1980) Nucleic Acids 
Symp. Set. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Altemativdy, 
RECAP itself or a fragment thereof may be synthesized using chemical methods. For example, peptide 
synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., Crelghton, 

10 T. (1984) Proteins. Structures and Molecular Properties . WH Freeman, New York NY, pp. 55-60; and 
RobCTge, J. Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved using the ABI 
431 A peptide synthesize (PE Biosystems). Additionally, the amino acid sequence of RECAP, or any 
part thereof, may be alteed during direct synthesis and/or combined with sequ^ces from otim proteins, 
CM- any part thareof, to produce a variant polypq)tide or a polypqptide having a sequence of a naturally 

15 occurring polypeptide. 

The peptide may be substantially purified by preparative high p^formance liquid 
chromatography. (See, e.g., Chiez, R.M. andF.Z. Regni€ar (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic pq)tides may be conjfirmed by amino acid analysis or by sequencing. 
(See, e.g., Crdghton, supra , pp. 28-53.) 

20 In order to express a biologically active RECAP, the nucleotide sequences ^coding RECAP or 

dmvatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains the 
necessary el^nents for transcriptional and translational control of the inserted coding sequence in a 
suitable host These d^ents include regulatory sequences, such as enhancers, constitutive and inducible 
promoters, and 5' and 3' untranslated regions in the vector and in polynucleotide sequences encoding 

25 RECAP. Such el^ents may vary in thdr str^gth and specificity. Specific initiation signals may also 
t>e used to achieve more efficient translation of sequences encoding RECAP. Such signals include the 
ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where sequences 
encoding RECAP and its initiation codon and upstream regulatory sequences are iaserted into the 
appropriate expression vector, no additional traascriptional or translational control signals may be 

30 needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous 
translational control signals including an in-frame ATG initiation codon should be provided by the vector. 
Exogenous translational elements and initiation codons may be of various origins, both natural and 
synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for 
the particular host cdl system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 

35 20:125-162.) 
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Methods which are well known to those skilled in the art may be used to construct expression 
vectCMTS containing sequences encoding RECAP and appropriate transcriptional and translational control 
elements. These methods include in vitro recombinant DN A techniques, synthetic techniques, and in vivo 
genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning^ A Laboratorv Manual s 
5 Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, P.M. et al. (1995) Current 
Protocols in Molecular Bioloev , John Wiley & Sons, New York NY. ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding RECAP. These include, but are not limited to, microorganisms such as baama transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
10 yeast expression vectors; insect cdH systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus. TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems. (See, e.g., Sambrook, supra : Ausubel, supra : Van Heeke. G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; 
15 Scorer, C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) Proc. Natl. Acad. 
Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; Takamatsu. N. (1987) 
EMBO J. 6:307-31 1; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) Science 
p 224:838-843; Winter, J. et al, (1991) Results Probl. Cell Differ. 17:85-105; TTie McGraw Hill 

Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 191-196; Logan, J. and 
J 20 T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and Harrington, J.J. et al. (1997) Nat. 
O Genet. 15:345-355.) Expression vectors derived from retroviruses, adenoviruses, or heq3es or vaccinia 

viruses, or from various bacterial plasmids, may be used for delivery of nucleotide sequences to the 
targeted organ, tissue, or cell population. (See, e.g., Di Nicola, M. et al. (1998) Canco: Gen. Ther. 
5(6):350-356; Yu, M. et al., (1993) Proc. Natl. Acad. Sci. USA 90(13):6340-6344; Buller, R.M. et al. 
25 (1985) Nature 317(6040):813-815; McGregor, D.P. et al, (1994) Mol. Immunol. 31(3):219-226; and 
Verma, I.M. and N. Somia (1997) Nature 389:239-242.) The invention is not limited by the host cell 
employed. 

In bacterial systems, a numt>^ of cloning and expression vectors may be selected depending upon 
the use intended for polynucleotide sequences encoding RECAP. For example, routine cloning, 
30 subcloning, and propagation of polynucleotide sequences encoding RECAP can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORTl plasmid 
(Life Technologies). Ligation of sequences encoding RECAP into the vector' s.multiple cloning site 
disrupts the ladZ gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these veaors may be useful for in vitro 
35 transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested deletions 
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in the cloned sequence. (See, e.g.. Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 
264:5503-5509.) When large quantities of RECAP are needed, e.g» for the production of antibodies, 
vectors which direct high level expression of RECAP may be used. For example, vectors containing the 
strong, inducible T5 or T7 bacteriophage promote may be used. 
5 Yeast expression systems may be used for production of RECAP. A number of vectcM-s 

containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia oastoris . In addition, such 
vectors direa eith^ the secretion or intracellular retention of expressed protdns and enable integration of 
foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1995, supra : Bitter, 

10 supra : and Scocet, supra .) 

Plant systems may also be used for expression of RECAP. Transcription of sequences encoding 
RECAP may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used alone or in 
combination with the omega leader sequence from TMV (Takamaisu, N. (1987) EMBO J. 6:307-31 1), 
Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be 

15 used. (See, e.g., Coruzzi, supra : Broglie, supra : and Winter, supra .) These constructs can be introduced 
into plant cells by direct DNA transformation or pathogen-mediated transfection. (See, e.g.. The 
McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New YacK NY, pp. 191-196.) 

In mammalian cells, a numb^ of viral-based expression systems may be utilized. In cases whi^e 
an adenovirus is used as an expression vector, sequmces encoding RECAP may be ligated into an 

20 adenovirus transcription/translation complex consisting of the late promote and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses RECAP in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhance, may be used to increase expression in mammalian host cells. SV40 or EB V-based 

25 vectors may also be used fcr high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larg^ fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 1 0 Mb are 
constructed and delivered via conventional delivery methods Oiposomes, polycationic amino polymers, or 
vesicles) for therapeutic purposes. (See, e.g„ Harrington, J.J. et al. (1997) Nat. Genet, 15:345-355.) 

30 For long term production of recombinant proteins in mammalian systems, stable expression of 

RECAP in cell lines is preferred. For example, sequences encoding RECAP can be transformed into cell 
lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Following the 
introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 

35 b&ng switched to selective media. The purpose of the selectable marker is to confer resistance to a 
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selective agent, and its presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These include^ 

5 but are not limited to, the hopes simplex virus thymidine kinase and adenine phosphoribosyltransferase 
genes, for use in tk' and apr cells, respectively. (See, e.g., Wigler. M. et al. (1977) Cell 1 1 :223-232; 
Lowy, I. et al. (1980) Cdl 22:817-823.) Also, antim^abolite, antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confo-s resistance to methotrexate; neo confCTS 
resistance to the aminoglycosides neomycin and G-418; and als and pat confo- resistance to chlorsulfliron 

10 and phosphinotricin ac^yltransfoase, respectively. (See. e.g., Wigjer, M. ^ al. (1980) Proc. Natl. Acad. 
Sci. USA 77:3567-3570; Colbo-e-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14.) Additional 
selectable genes have bean described. e.g., trpB and hisD, which alter cellular requirements for 
metabolites. (See, e.g., Hartman. S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 
85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent protdns (GFP; Clontech), B 

15 glucuronidase and its substrate 6-glucuronide, or luciferase and its substrate luciferin may be used. 

These markers can be used not only to identify transformants, but also to quantify the amount of transient 
or stable protein expression attributable to a specific vector system. (See, e.g.. Rhodes. C. A. (1995) 
Methods Mol. Biol. 55:121-131.) 

Although the presOTce/absence of mark^ grae expression suggests that the gene of interest is 

20 also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding RECAP is inserted within a marker gene sequence, transformed cells containing 
sequcaices encoding RECAP can be identified by the absence of marker gene function. Altmatively, a 
marker gene can be placed in tandem with a sequence encoding RECAP under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 

25 expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding RECAP and that express 
RECAP may be identified by a variety of procedures known to those of skill in the art. These procedures 
include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR amplification, and protein 
bioassay or immunoassay techniques which include membrane, solution, or chip based technologies for 

30 the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of RECAP using eitho* 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include 
enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence activated 
cell sorting (FACS), A two-site, monoclonal-based inununoassay utilizing monoclonal antibodies 

35 reactive to two non-int^fering epitopes on RECAP is preferred, but a competitive binding assay may be 
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employed. These and other assays are well known in the art. (See, e.g., Hampton, R. et al. (1990) 
Serological Methods, a Laboratory Manual , APS Press, St. Paul MN, Sect. IV; Coligan, J.E. et al. 
(1997) Current Protocols in Immunology , Greene Pub. Associates and Wiley-IntCTscience, New York 
NY; and Pound, J.D. (1998) Immunochemical Protocols . Humana Press, Totowa NJ.) 
5 A wide yariety of labels and conjugation techniques are known by those skilled in the art and 

may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PGR probes lor detecting sequences related to polynucleotides encoding RECAP include oligolabeling, 
nick translation, end-labeling, or PCR amplification using a labded nucleotide. Altea-nativdy, the 
sequCTces encoding RECAP, or any fragments thereof, may be cloned into a vector for the production of 
10 an mRNA probe. Such vectors are known in the art. are commercially available, and may be used to 
synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of commercially available 
kits, such as those provided by Ama-sham Pharmacia Biotech, Promega (Madison WI), and US 
Biochemical. Suitable reportCT molecules or labels which may be used for ease of detection include 
15 radionuclides, aizymes, fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, 
cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences ^coding RECAP may be cultured undo* 
conditions suitable for the expression and recovery of the protein from cell culture. The proton produced 
by a transformed cell may be secreted or retained inu*acdlularly depending on the sequence and/or the 
20 vector used. As will be understood by those of skill in the art, expression vectors containing 
polynucleotides which ^code RECAP may be designed to contain signal sequences which direct 
secretion of RECAP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the insmed 
sequences or to process the expressed protdn in the desired fashioa Such modifications of the 
polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, 
lipidation, and acylation. Post-translational processing which cleaves a "prqjro" ac ''pro** form of the 
protein may also be used to specify protein targeting, folding, and/or activity. DiftCTent host cells which 
have specific cellular machinoy and characteristic mechanisms for post-translational activities (e.g., 
CHO, HeLa, MDCK. HEK293, and WI38) are available from the American Type Culture Collection 
(ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing of the 
fordgn protein. 

In another embodiment of the invention, natural, modified, or recombinant nucldc acid sequences 
encoding RECAP may be ligated to a heterologous sequence resulting in translation of a fusion protein in 
any of the afOTonentioned host systems. For example, a chimca-ic RECAP protein containing a 
heterologous moiety that can be recognized by a commercially available antibody may facilitate the 
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screening of peptide libraries for inhibitors of RECAP activity. HetCTOlogous protein and peptide 
moieties may also facilitate purification of fusion proteins using conmiCTcially available affinity matrices. 
Such moieties include^ but are not limited to, glutathione S-transferase (GST), maltose binding protdn 
(MBP). thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and hemagglutinin 

5 (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate ftision protdns on 

immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, respectively. 
FLAG, c-myc, and hemagglutinin (HA) enable inununoaffinity purification of fusion protdns using 
commercially available monoclonal and polyclonal antibodies that specifically recognize these q)itope 
tags. A fusion protdn may also be engineered to contain a proteolytic cleavage site located between the 

10 RECAP encoding sequence and the heterologous protdn sequence, so that RECAP may be cleaved away 
from the het^ologous moiety following purification. Methods for fusion protdn ^pression and 
purification are discussed in Ausubel (1995, supra , ch. 10). A variety of commerdally available kits 
may also be used to fadlitate expression and purification of fusion proteins. 

In a furtl»- embodiment of the invention, synthesis of radiolabeled RECAP may be achieved in 

15 vitro using the TNT rabbit reticulocyte lysate or wheat gg-m extract system (Promega). These systems 
couple transcription and translation of protein-coding sequmces opwably associated with the T7, T3, or 
SP6 promotCTs. Translation takes place in the presence of a radiolabded amino add precursor, for 
example, ^^S-methionine. 

RECAP of the present invention or fragments thereof may be used to screen for compounds 

20 that spedfically bind to RECAP. At least one and up to a plurality of test compounds may be screened 
for specific binding to RECAP. Examples of test compounds include antibodies, oligonucleotides, 
proteins (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
RECAP, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 

25 natural binding partner. (See, Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which RECAP 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In eitha* case, the 
compound can l)e rationally designed using known techniques. In one embodiment, screening for these 
compounds involves producing appropriate cells which express RECAP, either as a secreted protein or 

30 on the cell membrane. Preferred cells include cells fi-om mammals, yeast, E)rosoDhila , or E. coli . Cells 
expressing RECAP or cell membrane fractions which contain RECAP are then contacted with a test 
compound and binding, stimulation, or inhibition of activity of either RECAP or the compound is 
analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
35 detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, the 
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assay may comprise the steps of combining at least one test compound with RECAP, either in solution 
or affixed to a solid support, and detecting the binding of RECAP to the compound. Alternatively, the 
assay may detect or measure binding of a test compound in the presence of a labeled competitor. 
Additionally, the assay may be carried out using cell-free preparations, chemical libraries, or natural 
5 product mixtures, and the test compound(s) may be free in solution or affixed to a solid support. 

RECAP of the present invention or fragments thereof may be used to screen for compounds 
that modulate the activity of RECAP. Such compounds may include agonists, antagonists, or partial or 
inv^e agonists. In one embodiment, an assay is performed under conditions pCTmissive for RECAP 
activity, whwein RECAP is combined with at least one test compound, and the activity of RECAP in 
10 the presence of a test compound is compared with the activity of RECAP in the absence of the test 
compound. A change in the activity of RECAP in the presence of the test compound is indicative of a 
compound that modulates the activity of RECAP. Alternatively, a test compound is combined with an 
in vitro or cell-free system comprising RECAP under conditions suitable for RECAP activity, and the 

O assay is performed. In either of these assays, a test compound which modulates the activity of RECAP 

Id 

£2 15 may do so indirectly and need not come in direct contact with the test compound. At least one and up 
%Q to a plurality of test compounds may be screened. 

j« In another embodiment, polynucleotides encoding RECAP or their mammalian homologs may 

« be "knocked out" in an animal model system using homologous recombination in embryonic stem (ES) 

O 

cells. Such techniques are well known in the art and are useful fox the generation of animal models of 
20 human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337.) For example, 

03 

p mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse embryo and 

grown in culture. The ES cells are transformed with a vector containing the gene of interest disrupted 
by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M.R. (1989) Science 
244: 1288-1292). The vector integrates into the corresponding region of the host genome by 
25 homologous recombination. Alternatively, homologous recombination takes place using the Cre-loxP 
system to knockout a gene of interest in a tissue- or developmental stage-specific manner (Marth, J.D. 
(1996) Clin. Invest. 97:1999-2002; Wagner, K.U, et al. (1997) Nucleic Acids Res. 25:4323-4330). 
Transformed ES cells are identified and microinjected into mouse cell blastocysts such as those from 
the C57BL/6 mouse strain. The blastocysts are surgically transferred to pseudopregnant dams, and the 
30 resulting chimeric progeny are gcnotyped and bred to produce heterozygous or homozygous strains. 
Transgenic animals thus generated may be tested with potential therapeutic or toxic agents. 

Polynucleotides encoding RECAP may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
35 into, for example, neural cells, hematopoietic lineages, and cardiomyocyies (Thomson, J.A. et al. 
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(1998) Science 282:1145-1 147). 

Polynucleotides encoding RECAP can also be used to create "knoclcin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a region 
of a polynucleotide encoding RECAP is injected into animal ES cells, and the injected sequence 
5 integrates into the animal cell genome. Transformed cells are injected into blastulae, and the blastulae 
are implanted as described above. Transgenic progeny or inbred lines are studied and treated with 
potential pharmaceutical agents to obtain information on treatment of a human disease. Alternatively, a 
mammal inbred to overexpress RECAP, e.g., by secreting RECAP in its milk, may also serve as a 
convenient source of that protein (Janne, J. et al. (1 998) Biotechnol. Annu. Rev. 4:55-74). 
10 THERAPEUTICS 

Chemicsd and structural similarity, e.g., in the context of sequences and motifs, exists between 
regions of RECAP and receptors and associated proteins. In addition, the expression of RECAP is 
closely associated with cell proliferation, cancer, inflammation/trauma, and with neurological disorders. 
Therefore, RECAP appears to play a role in neurological disorders; immunological disorders, including 

15 autoimmune/inflammatory disord^s; and cell prolif^ative disorders, including cancer. In the treatment 
of disord^ associated with increased RECAP expression or activity, it is desirable to decrease the 
expression or activity of RECAP. In the treatment of disord^ associated with decreased RECAP 
expression or activity, it is desirable to increase the expression or activity of RECAP. 

Therefore, in one embodiment, RECAP or a fragment or derivative th^eof may be 

20 administered to a subject to treat or prevent a disord^ associated with decreased expression or activity 
of RECAP. Examples of such disorda*s include, but are not limited to, a neurological disorder such as 
epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease. Pick's 
disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
Down's syndrome, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural 

25 muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative 
intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system disease; prion 
diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome; 
fatal familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, 

30 tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental 
retardation and other developmental disorders of the central nervous system, cerebral palsy, 
neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 
diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 
dermalomyositis and polymyositis; inherited, metabolic, endocrine, and toxic myopathies; myasthenia 

35 gravis, periodic paralysis; mental disorders including mood, anxiety, and sciiizophrenic disorders; 
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seasonal affective disorder (SAD); akathesta, amnesia, catatonia, diabetic neuropathy, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 
supranuclear palsy, corcicobasal degeneration, and familial frontotemporal dementia; an immunological 
disorder, including autoimmune/inflammatory disorders, such as acquired immunodeficiency syndrome 
5 (AIDS), X-linked agammaglobinemia of Bruton, common variable immunodeficiency (CVI), 

DiGeorge's syndrome (thymic hypoplasia), thymic dysplasia, isolated IgA deficiency, severe combined 
inununodeficiency disease (SCID), immunodeficiency with thrombocytopenia and eczema (Wiskott- 
Aldrich syndrome), Chediak-Higashi syndrome, chronic granulomatous diseases, hereditary 
angioneurotic edema, and immunodeficiency associated with Cushing's disease, Addison's disease, 

10 adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic 
lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis. 

15 glomerulonephritis, Goodpasture's syndrome, gout. Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
p^cardial inflammation, osteoartiuitis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 
syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
lupus ^ythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner 

20 syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 
fungal, parasitic, protozoal, and helminthic inlections, hematopoietic cancers, including lymphoma, 
leukemia, and myeloma, and trauma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease (MCTD), 
myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary 

25 thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, 
sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, bone marrow, 
brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, 
ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and 
uterus. 

30 In anotho" embodiment, a vector capable of expressing RECAP or a fragment or derivative 

thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of RECAP including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
RECAP in conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or 

35 prevent a disorder associated with decreased expression or activity of RECAP including, but not limited 
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to, those provided above. 

In still anoth^ embodiment, an agonist which modulates the activity of RECAP may be 

administered to a subject to treat or prevent a disorder associated with decreased expression or activity 

of RECAP including, but not limited to, those listed above. 
5 In a furthCT embodiment, an antagonist of RECAP may be administ€3*ed to a subject to treat or 

prevent a disorder associated widi increased expression or activity of RECAP. Examples of such 

disorders include, but are not limited to, those neurological disord^s; immunological disorders, Including 

autoimmune/inflanunatory disorders; and cell prolif^ative disordo's, including cancer, described above. 

In one aspect, an antibody which specifically binds RECAP may be used directly as an antagonist or 
10 indirectly as a targeting or delivoy mechanism for bringing a pharmaceutical agent to cells or tissues 

which express RECAP. 

In an additional anbodim^t, a vector expressing the complement of the polynucleotide encoding 
RECAP may be administared to a subject to treat or prevent a disord^ associated with increased 
expression ot activity of RECAP including, but not limited to, those desaibed above. 

15 In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 

sequences, or vectors of the invention may be administared in combination with otho* appropriate 
th^-apeutic agents. Selection of the appropriate agents fOT use in combination th^apy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination of 
th«-apeutic agents may act synergistically to effect the treatment or prevention of the various disorders 

20 described above. Using this approach, one may be able to achieve tha-ap^tic efficacy with Iowa- 
dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of RECAP may be produced using methods which are genially known in the art. 
In particular, purified RECAP may be used to produce antibodies or to screm libraries of pharmaceutical 
agents to identify those which speciilcally bind RECAP. Antibodies to RECAP may also be generated 

25 using methods that are wdl known in the art. Such antibodies may include, but are not limited to, 

polyclonal, monoclonal, chim^c, and single chain antibodies. Fab fragments, and fragments produced by 
a Fab expression library. Neutralizing antibodies (i.e., those which inhibit dimer fcwmation) are genially 
preferred for th^apeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, and 

30 others may be inununized by injection with RECAP or with any fragment or oligopeptide th^eof which 
has immunogenic propmies. Depending on the host species, various adjuvants may be used to increase 
immunological response. Such adjuvants include, but are not limited to, Freund's, minea-al gels such as 
aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, polyanions, 
peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG (bacilli 

35 Calmette-Guerin) and Corvnebacterium parvum are especially prefo'able. 
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It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to RECAP 
have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of at 
least about 10 amino acids. It is also preferable that these oligopeptides, pq>tides, or fragments are 
identical to a portion of the amino acid sequence of the natural protein. ShCMt stretches of RECAP amino 
5 acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric molecule 
may be produced. 

Monoclonal antibodies to RECAP may be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are not limited 
to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma technique. 
10 (See, e.g.. KoWer, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. Immunol. Methods 
81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and Cole, S.P. et al. (1984) 
Mol. Cell Biol. 62:109-120.) 
|4. In addition, techniques developed for the production of "chimeric antibodies," such as the 

^ splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate antigen 

yJ 15 specificity and biological activity, can be used. (See. e.g., Morrison, S.L. et al. (1984) Proc. Nati. Acad. 
5 Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, S. et al. 

5 (1985) Nature 314:452-454.) Alternativdy, tecdiniques described tor the production of single chain 

B antibodies may be adapted, using methods known in the art, to produce RECAP-specific single chain 

P antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be geno^ated by 

M" 20 chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g.. Burton, D.R. (1991) 
p Proc. Nati. Acad. Sci. USA 88:10134-10137.) 

rj Antibodies may also be produced by inducing in vivo production in the lymphocyte population or 

by screening immunogJobuUn libraries or panels of highly specific binding reagents as disclosed in the 
Dterature. (See, e.g., Orlandi, R. et al. (1989) Proc. Nati. Acad. Sci. USA 86:3833-3837; Winter, G. et 
25 al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for RECAP may also be generated. For 
example, such fragments include, but are not Umited to, F(ab*)2 fragments produced by pepsin digestion 
of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the F(ab*)2 
fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy 
30 identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. et al. 
(1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identic antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
35 immunoassays typically involve die measurement of complex formation between RECAP and its specific 
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antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two 
non-interfering RECAP c?)itopes is generally used, but a compeUtive binding assay may also be employed 
(Pound, supra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
5 may be used to assess the affinity of antibodies for RECAP. Affinity is expressed as an association 
constant, K^, which is defined as the molar concentration of REC AP-antibody complex divided by the 
molar concentrations of free antigen and free antibody undear equilibrium conditions. The determined 
fcM- a preparation of polyclonal antibodies, wtuch are heterog^eous in thdr affinities for multiple 
RECAP epitopes, rq)resents the average affinity, or avidity, of the antibodies for RECAP. The 
10 determined for a preparation of monoclonal antibodies, which are monospecific for a particular RECAP 
epitope, represents a true measure of affinity, ffigh-affinity antibody preparations with ranging from 
about 10^ to 10^^ L/mole are prefOTed for use in inmiunoassays in which the REC AP-antibody complex 
^ must witiistand rigorous manipulations. Low-affinity antibody preparations with ranging from about 

10^ to 10' L/mole are preferred for use in immunopurification and similar procedures which ultimately 
15 require dissociation of RECAP, prefo-ably in active form, from ttie antibody (Catty, D. (1988) 
£ Antibodies. Volume I: A Practical Approach . IRL Press, Washington DC; Liddell, J.E. and A. Cryer 

(1991) A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, New Ywk NY). 

The titer and avidity of polyclonal antibody pr^arations may be ftirtha* evaluated to detarmine 
the quaUty and suitability of such preparations for catain downstream applications. For example, a 
20 polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of REC AP-antibody 
complexes. Procedures for evaluating antibody specificity, titar. and avidity, and guidelines for antibody 
quality and usage in various applications, are genially available. (See, e.g.. Catty, sjipra, and Coligan et 
al., supra .) 

25 In another embodiment of the invention, the polynucleotides encoding RECAP, or any fragment 

or complemait thereof, may be used for therapeutic puqK)ses. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, RNA, 
PN A, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding RECAP. 
Such technology is wdl known in the art, and antisense oUgonucleotides or largo* fragments can be 
30 designed from various locations along the coding or control regions of sequences encoding RECAP. 
(See. e.g., Agrawal, S., ed. (1996) Antisense Therapeutics . Humana Press Inc., Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 
35 complementary to at least a portion of the cellular sequence encoding die target protein. (See, e.g., 
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Slater, J.E. et al. (1998) J, Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9(1 3): 1 288- 1 296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g.. Miller, A.D. (1990) Blood 
76:271; Ausubel, supra : Uckert, W. and W. Walther (1994) Pharmacol. Then 63(3):323-347,) Other 
5 gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in the art, (See, e.g., Rossi, J.J. (1995) Br. Med. Bull. 51(l):217-225; Boado, RJ. et al. 
(1998) J. Pharm. Sci. 87(1 1):1308"1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 
25(14):2730-2736.) 

In another embodiment of the invention, polynucleotides encoding RECAP may be used tor 
10 somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of seva*e combined inmiunodeficiency (SCID)-X1 disease characterized by X-linked 
inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inhmted admosine deaminase (ADA) deficiency (Blaese, 
p R.M. et al. (1995) Science 270:475-480; Bordignon, C. ^ al. (1995) Science 270:470-475), cysUc 

15 fibrosis (Zabner, J. et al. (1993) Cdl 75:207-216; Crystal, R.G. et al. (1995) Hum. G^e Th^apy 6:643- 
^rl 666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 

hypercholesterol^a, and hemophilia resulting fi-om Factor VIII or Factor IX deficiencies (Crystal, R.G. 
1, (1995) Science 270:404-410; Verma, I.M. and Somia, N. (1997) Nature 389:239-242)), (li) express a 

conditionally lethal gene product (e.g., in the case of canc^s which result from unregulated cell 
20 prolita-ation), or (iii) express a protdn which affords protection against intracdlular parasites (e.g., 
against human retroviruses, such as human inmiunodeficiency vims (HIV) (Baltimore, D. (1988) Nature 
335:395-396; PoescWa, E. ^ al. (1996) Proc. Natl. Acad. Sci. USA. 93:11395-11399), hq)atitis B or C 
virus (HB V, HC V); fungal parasites, such as Candida albicans and Paracoccidioides brasiliensis : and 
protozoan parasites such as Plasmodium falciparum and Trvpanosoma cruzi) . In the case where a 
25 genetic deficiency in RECAP expression or regulation causes disease, the expression of RECAP fi-om an 
appropriate population of transduced cdls may alleviate the clinical manifestations caused by the genetic 
deficiency. 

In a furthCT embodiment of the invention, diseases or disorders caused by defici^cies in RECAP 
are treated by constructing mammalian expression vectors encoding RECAP and introducing these 
30 vectors by mechanical means into RECAP-deficient cells. Mechanical transfer technologies for use with 
cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic gold 
particle deliv^, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) the 
use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. Biochem. 62:191-217; 
Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H. R6cipon (1998) Curr. Opin. Biotechnol. 9:445- 
35 450). 
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Expression vectors that may be effective for the expression of RECAP include, but are not 
Umited to, the PGDN A 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRIPT, PCMV-TAG. PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF. PTET-ON, 
PTRE2, PTRE2-LUC. PTK-HYG (Clontech, Palo Alto CA). RECAP may be expressed using (i) a 
constitutively active promoter, (e,g., from cytomegalovirus (CMV), Rous sarcoma virus (RSV), SV40 
virus, thymidine kinase (TK), or P-actin genes), (ii) an inducible promoter (e.g., the tetracycline-regulated 
promoter (Gossot, M. and H. Bujard (1992) Proc. Natl. Acad Sci. USA 89:5547-5551; Goss^ M. et 
al, (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M. Blau (1998) Curr. Opin. Biotechnol. 9:451- 
456), commercially available in the T-REX plasmid (Invitrogen)); the ecdysone-inducible promoter 
(available in the plasmids PVGRXR and PIND; Invitrogen); the FK506/rapamycin inducible promoter; 
or the RU486/mifqpristone inducible promoter (Rossi, F.M.V. and H.M. Blau, supra) ), or (iii) a tissue- 
specific promoter or the native promote of the ^dog^ous gene encoding RECAP from a normal 
individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogm) allow one with cffdinary skill in the art to deliver 
polynucleotides to targ^ cells in culture and require minimal effort to optimize expaimental paramet^s. 
In the altanative^ transformation is performed using the calcium phosphate method (Graham, F.L. and 
A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. (1982) EMBO J. 
1 :841 -845). The introduction of DNA to primary cells requires modification of these standardized 
mammalian transfection protocols. 

In anoth^ ^nbodiment of the invention, diseases or disorders caused by g^ietic defects with 
respect to RECAP expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding RECAP under the control of an independent promote or the retrovirus long 
terminal repeat (LTR) promoter, (ii) ^jprc^riate RNA packaging signals, and (iii) a Rev-responsive 
Ciiemsat (RRE) along with additional retrovirus ci5*acting RNA sequraces and coding sequences required 
for efficient vector prc^agation. R^ovirus vectcffs (e.g., PFB and PFBNEO) are conunercially available 
(Stratagene) and are based on published data (Riviere, I. ei al. (1995) Proc. Natl. Acad. Sci. USA 
92:6733-6737), incorporated by reference herein. The vector is propagated in an appropriate vector 
producing cell line (VPCL) that expresses an envelope gene with a tropism for rec^tors on the target 
cells or a promiscuous envelope protdn such as VSVg (Armentano, D. et al. (1987) J. Virol. 61 :1647- 
1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and A.D. Miller (1988) J. Virol. 
62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. et al. (1998) J. Virol. 72:9873- 
9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining retrovirus packaging cell lines 
producing high transducing efficiency reU'oviral supmiatant") discloses a method for obtaining retrovirus 
packaging cell lines and is h^eby incorporated by reference. Propagation of retrovirus vectors, 
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transduction of a population of c^Ls (e,g., CD4* T-cells), and the return of transduced cells to a patient 
are procedures well known to pa-sons skilled in the art of gene therapy and have been well documented 
(Ranga. U. et al. (1997) J. Virol. 71:7020-7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, 
M.L. (1997) J. Virol. 71:4707-4716; Ranga, U. et al. (1998) Proc. Natl. Acad Sci. USA 95:1201-1206; 

5 Su, L. (1997) Blood 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding RECAP to cdls which have one or more genetic abnormalities with respect to 
the expression of RECAP. The construction and packaging of adenovirus-based vectors are wdl known 
to those with OTdinary skill in the art. Replication defective adenovirus vectors have proven to be 

10 versatile for importing genes «icoding immunoregulatory proteins into intact islets in the pancreas (Csete, 
M.E. et al. (1995) Transplantation 27:263-268). Potentially useixil adenoviral vectors are described in 
U.S. Patent Number 5,707,618 to Armentano ("Adenovirus vectors tor goie therapy"), hweby 
incorporated by reference. For adenoviral vectors, see also Antinozzi, P.A. et al. (1999) Annu. Rev. 
Nutr. 19:511-544; and Va-ma, LM. andN. Somia (1997) Nature 18:389:239-242, both incorporated by 

15 reference ha'dn. 

In anothar alternative, a herpes-based, gene thwapy delivery system is used to deliver 
polynucleotides ^coding RECAP to target cdls which have one or more genetic abnormalities with 
respect to the expression of RECAP. The use of hopes simplex virus (HS V)-based veacars may be 
especially valuable for introducing RECAP to cells of the central uctvous system, for which HSV has a 

20 tropism. The construction and packaging of herpes-based veaors are well known to those with ordinary 
skill in the art. A rq)lication-competent herpes simplex virus (HSV) type 1 -based wector has been used 
to ddivCT a repCMter grae to the eyes of primates (Uu, X. et al. (1999) Exp. Eye Res. 169:385-395). The 
construction of a HSV-1 virus vector has also been disclosed in detail in U.S. Patent NumbCT 5,804,413 
to DeLuca ("Herpes simplex virus strains for gene transfer"), which is hereby incorporated by reference. 

25 U.S. Patent Number 5,804,413 teaches the use of recombinant HSV d92 which consists of a genome 
containing at least one exogenous gene to be transfm-ed to a cell undar the control of the appropriate 
promoter for purposes including human gene therapy. Also taught by this patent are the construction and 
use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22, For HSV vectors, see also Coins. 
W.F. et al. (1999) J. Virol. 73:519-532 and Xu. H. et al. (1994) Dev. Biol. 163:152-161, hereby 

30 incorporated by reference. The manipulation of cloned h^esvirus sequences, the generation of 
recombinant virus following the transfection of multiple plasmids containing different segments of the 
large herpesvirus genomes, the growth and propagation of herpesvirus, and the infection of cells with 
herpesvirus are techniques well known to those of ordinary skill in the art. 

In anotho- alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 

35 deliver polynucleotides encoding RECAP to target cells. The biology of the prototypic alphavirus, 
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Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based on 
the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotech. 9:464-469). During alphavirus 
RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid proteins. This 
subgenomic RNA replicates to higho" levels than the full-length genomic RNA, resulting in the 

5 ovCTproduction of capsid proteins relative to the viral proteins with enzymatic activity (e.g., protease and 
polymerase). Similarly, inserting the coding sequence for RECAP into the alphavirus genome in place of 
the capsid-coding region results in the production of a large numbo- of REC AP-coding RNAs and the 
synthesis of high levels of RECAP in veaor transduced cells. While alphavirus infection is typically 
associated with cell lysis within a few days, the ability to establish a po-sistent infection in hamster 

10 normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that the lytic repUcation of 
alphaviruses can be altored to suit the needs of the gene thwapy application (Dryga, S.A. et aL (1997) 
Virology 228:74-83). The wide host range of alphaviruses will allow the introduction of RECAP into a 
variety of cell types. The specific transduction of a subset of cells in a population may require the sorting 
of cells prior to transduction. The methods of manipulating infectious cDNA clones of alphaviruses, 

15 p^orming alphavirus cDNA and RNA transfections, and performing alphavirus infections, are wdl 
known to those with ordinary skill in the art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 
and +10 from the start site, may also be employed to inhibit gene expressioa Similarly, inhibition can be 
achieved using triple h^x base-pairing methodology. Triple helix pairing is useful because it causes 

20 inhibition of the ability of the double hdix to open sufficientiy for the binding of polymerases. 

transcription factors, or regulatOTy molecules. Recent tho-apeutic advances using triplex DNA have bera 
described in tiie literature. (See, e.g.. Gee. J.E. et al. (1994) in Hubo^, B.E. and B.I. Carr, Molecular and 
Immunologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 163-177.) A conqjlementary sequence 
or antisense molecule may also be designed to block translation of mRNA by preventing the transcript 

25 from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequoice-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, engineered 
hammerhead motif ribozyme molecules may specifically and efficientiy catalyze endonucleolytic cleavage 

30 of sequences encoding RECAP. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning die target molecule for ribozyme cleavage sites, including the following sequences: GUA, GUU, 
and cue. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, corresponding to 
the region of the target gene containing the cleavage site, may be evaluated for secondary structural 

35 features which may render the oligonucleotide inoperable. The suitability of candidate targets may also 
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be evaluated by testing accessibility to hybridization with complementary oligonucleotides using 
ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prq^ared by 
any method known in the art for the synthesis of nucldc acid molecules. These include techniques for 
5 chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Altffnativdy, RNA molecules may be generated by in vitro and in vivo transcription of DN A sequences 
encoding RECAP. Such DNA sequences may be incorporated into a wide vari^ of vectors with 
suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 
synthesize complementary RNA, constitutively or inducibly, can be, introduced into cell lines, cells, or 
10 tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5* and/or 3' ends of 
the molecule, (h- the use of phosphorothioate or 2' O-methyl rath^ than phosphodiesterase linkages within 
the backbone of the molecule. This conc^t is inherent in the production of PN As and can be extended in 

15 all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and wybutosine, 
as wdl as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, thymine, and 
uridine which are not as easily recognized by endogenous endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding RECAP. Compounds 

20 which may be effective in alt^ng ex]>ression of a specific polynucleotide may include, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other po])^ptide transcriptional regulators, and non-macromolecular chemical 
entities which are capable of interacting with specific polynucleotide sequences. Effective compounds 
may alter polynucleotide expression by acting as either inhibitors or promoters of polynucleotide 

25 expression. Thus, in the treatment of disorders associated with increased RECAP expression or 
activity, a compound which specifically inhibits expression of the polynucleotide encoding RECAP 
may be therapeutically useful, and in the treament of disorders associated with decreased RECAP 
expression or activity, a compound which specifically promotes expression of the polynucleotide 
encoding RECAP may be therapeutically useful. 

30 At least one, and up to a plurality, of test compounds may be screened for effectiveness in 

altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound based 

35 on chemical and/or structural properties of the target polynucleotide; and selection from a library of 

45 



SUBSTITUTE SHEET (RULE 26) 



WO 01/07612 PCT/USOO/20035 

chemical compounds created combinatoiiaily or randomly. A sample comprising a polynucleotide 
encoding RECAP is exposed to at least one test compound thus obtained. Hie sample may comprise, 
for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted biochemical system. 
Alterations in the expression of a polynucleotide encoding RECAP are assayed by any method 
5 commonly known in the art. Typically, the expression of a specific nucleotide is detected by 
hybridization with a probe having a nucleotide sequence complementary to the sequence of the 
polynucleotide encoding RECAP. The amount of hybridization may be quantified, thus forming the 
basis for a comparison of the expression of the polynucleotide both with and without exposure to one 
or more test compounds. Diction of a change in the expression of a polynucleotide exposed to a test 
10 compound indicates that the test compound is effective in altering the expression of the polynucleotide. 
A screen for a compound effective in altering expression of a specific polynucleotide can be carried 
out, for example, using a Schizosaccharomvces pombe gene expression system (Atkins, D. et al. (1999) 
U.S. Patent No. 5,932,435; Amdt, G.M. et al. (2000) Nucleic Acids Res. 28£15) or a human cell Une 
^ such as HeLa cell (Clarke, MJ-. et al. (2000) Biochem. Biophys, Res. Commun. 268:8-13). A 
W 15 particular embodiment of the present invention involves screening a combinatorial library of 
, n oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide nucleic acids, and modified 
oligonucleotides) for antisense activity against a specific polynucleotide sequence (Bruice, T.W. et al. 

T (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. Patent No. 6,022,691). 

if-' 

r: Many methods for inu-oducing vectors into cells or tissues are available and equally suitable for 

|«* 20 use in vivo , in vitro , and ex vivo . For ex vivo th^apy , vectors may be introduced into stem cells taken 

m 

from the patient and clonally propagated for autologous transplant back into that same pati^t. Delivery 
iU by transfection, by liposome injectioas, or by polycationic amino polymers may be achieved using 
methods which are wdOi known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. Biotechnol. 
15:462-466.) 

25 Any of the therapeutic methods described above may be applied to any subject in need of such 

therapy, including, for sample, manunals such as humans, dogs, cats, cows, Imses, rabbits, and 
monkeys. 

An additional embodiment of die invention relates to the administration of a pharmaceutical 
composition which generally comprises an active ingredient formulated with a pharmaceutically 
30 acceptable excipient. Excipients may include, for example, sugars, starches, celluloses, gums, and 
proteins. Various formulations are commonly known and are thoroughly discussed in the latest edition of 
Remington*s Pharmaceutical Sciences (Maack Publishing, Easton PA). Such pharmaceutical 
compositions may consist of RECAP, antibodies to RECAP, and mimetics, agonists, antagonists, or 
inhibitors of RECAP. 

35 The pharmaceutical compositions utilized in this invention may be administered by any numb^ 
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ot routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, 
intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, 
topical, sublingual, or rectal means. 

Pharmaceutical compositions for pulmonary administration may be prepared in liquid or dry 

5 powder form. These compositions are generally aerosolized immediatdy prior to inhalation by the 
pati^t. in the case of small molecule (e.g, traditional low molecular weight organic drugs), ao^osol 
delivery of fast-acting formulations is well-known in the art. In the case of maaomolecules (e.g. larger 
peptides and proteins), recent developments in the field of pulmonary delivery via the alveolar region of 
the lung have enabled the practical ddivary of drugs such as insulin to blood circulation (see, e.g., Patton, 

10 J.S. et al.» U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of administration without 
needle injection, and obviates the need fcH* potentially toxic penetration enhancers. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein the 
active ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

15 Specialized forms of pharmaceutical compositioas may be prepared for direct intracellular 

delivCTy of macromolecules comprising RECAP or fragments thaeof. For example, liposome 
preparations containing a cell-impermeable macromolecule may promote cell fusion and inu^acellular 
delivery of the macromolecule. Alternatively, RECAP or a fragment thereof may be joined to a short 
cationic N-terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found 

20 to transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. a 
al. (1999) Science 285:1569-1572). 

For any compound, the th^apeuUcally effective dose can be estimated initially eitha in cell 
culture assays, e.g., of necplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, or 
pigs. An animal modd may also be used to determine the appropriate conc^tration range and route of 

25 administration Such information can then be used to determine useful doses and routes for 
administration in humans. 

A tho'apeutically effective dose refers to that amount of active ingredient, for example RECAP 
or fragments thereof, antibodies of RECAP, and agonists, antagonists or inhibitcx-s of RECAP, which 
ameliorates the symptoms or conditioa Therapeutic efficacy and toxicity may be determined by standard 

30 pharmaceutical procedures in cell cultures or with experimental animals, such as by calculating the ED50 
(the dose therapeutically effective in 50% of the population) or LD50 (the dose lethal to 50% of the 
population) statistics. The dose ratio of toxic to therapeutic effects is the therapeutic index, which can be 
expressed as the LDst/EDjo ratio. Pharmaceutical compositions which exhibit large therapeutic indices 
are preferred. The data obtained from cell culture assays and animal studies are used to formulate a 

35 range of dosage for human use. The dosage contained in such compositioas is preferably within a range 
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quantify gene expression in biopsied tissues in which expression of RECAP may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
RECAP, and to monitor regulation of RECAP levels during thaapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 

5 sequences, including genomic sequences, encoding RECAP or closely related molecules may be used to 
idaitify nucleic acid sequences vMch encode RECAP. The specificity of the probe, whether it is made 
from a highly specific region, e.g., tiie 5* regulatory region, or from a less specific region, eg., a 
conserved motif, and the stringency of the hybridization or amplification will determine wh^ha* the prcbt 
identifies (My naturally occurring sequences encoding RECAP, allelic variants, or related sequences. 

10 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to any of the RECAP encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:23-44 or from 
genomic sequences including promoters, enhancers, and introns of the RECAP gene. 

Meaas for producing specific hybridization probes for DNAs flooding RECAP include the 

15 cloning of polynucleotide sequ^ces encoding RECAP or RECAP derivatives into vectors for the 

production of mRNA probes. Such vectws are known in the art, are conmiCTCially available, and may be 
used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA polymerases 
and the apprqwiate labded nucleotides. Hybridization probes may be labeled by a variety of reporter 
groups, for example, by radionuclides such as ^^P or ^^S, or by enzymatic labels, such as alkaline 

20 phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding RECAP may be used for the diagnosis of disord^s associated 
with expression of RECAP. Examples of such disordCTS include, but are not limited to, a neurological 
disorder such as epilepsy, ischemic cerebrovascular disease, suroke, cerebral neoplasms, Alzheimer's 
disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and othear extrapyramidal 

25 disorders, Down's syndrome, amyotrophic lateral sclerosis and other motor neuron disorders, 
progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and 
other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidurjQ 
abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central n^vous system 
disease; prion diseases including kuru, CreutzfeldtJakob disease, and Gerstmann-Straussler-Scheinker 

30 syndrome; fatal familial insomnia, nutritional and metabolic diseases of the nervous system, 

neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal 
syndrome, mental retardation and other developmental disorders of the central nervous system, cerebral 
palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 
diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 

35 dermatomyositis and polymyositis; inherited, metabolic, endocrine, and toxic myopathies; myasthenia 
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gravis, periodic paralysis; mental disorders including mood, anxiety, and schizophrenic disorders; 
seasonal affective disorder (SAD); akathesia, amnesia, catatonia, diabetic neuropathy, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 
supranuclear palsy, corticobasal degen^ation, and familial frontotemporal dementia; an immunological 
5 disorder, including autoimmune/inflammatory disord^s, such as acquired immunodeficiency syndrome 
(AIDS), X-linked agammaglobinemia of Bruton, common variable immunodeficiency (CVI), 
DiGeorge's syndrome (thymic hypoplasia), thymic dysplasia, isolated IgA deficiency, severe combined 
immunodeficiency disease (SCID), immunodeficiency with thrombocytopenia and eczema (Wiskott- 
Aldrich syndrome). Chediak-Higashi syndrome, chronic granulomatous diseases, hereditary 
10 angioneurotic edema, and immunodeficiency associated with Gushing' s disease, Addison's disease, 
adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atheroscla-osis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
1^ polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
C3 dmnatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic 
ix.! 15 lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
^ glomerulonq)hritis, Goodpasture's syndrome, gout. Graves' disease, Hashimoto's thyroiditis, 
O hypea-eosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
^ pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 
O syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 

20 lupus arylhematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner 
03 syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 

n 

'^1 fungal, parasitic, protozoal, and helminthic infections, hematopoietic cancers, including lymphoma, 
leukemia, and myeloma, and trauma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease (MCTD), 

25 myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary 

thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, 
sarcoma, tCTatocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, bone marrow, 
brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, 
ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and 

30 uterus. The polynucleotide sequences encoding RECAP may be used in Southern or northern analysis, 
dot blot, or other membrane-based technologies; in PGR technologies; in dipstick, pin, and multiformat 
ELISA-like assays; and in microarrays utilizing fluids or tissues from patients to detect altered RECAP 
expression. Such qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding RECAP may be useful in assays that 

35 detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
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sequences encoding RECAP may be labeled by standard methods and added to a fluid or tissue sample 
from a patient undor conditions suitable for the formation of hybridization complexes. After a suitable 
incubation p^iod, the sample is washed and the signal is quantified and compared with a standard value. 
If the amount of signal in the patient sample is signiHcantly altered in comparison to a control sanq>le 
5 then the presence of altered levds of nuclec^de sequences encoding RECAP in the sample indicates the 
presence of the associated disorder. Such assays may also be used to evaluate the efficacy of a particular 
therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the treatment of an 
' individual patient. 

In oTiler to provide a basis for the diagnosis of a disorder associated with expression of REG AP» 
10 a normal or standard profile for expression is established. This may be accomplished by combining body 
fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a fragmoit 
th^eof, encoding RECAP, under conditions suitable for hybridization or amplification. Standard 

hybridization may be quantified by conq>aring the values obtained from normal subjects with values firom 

O 

hj an experiment in which a known amount of a substantially purified pol3mucleotide is used. Standard 

i: - 

15 values obtained in this mannar may be conqpared with values obtained fi*om sanQ>les from patients who 
are symptomatic for a disorder. Deviation from standard values is used to establish the presence of a 
disorda*. 

p Once the presence of a disorder is established and a treatment protocol is initiated, hybridization 

assays may be rq)eated on a regular basis to determine if the levd of expression in the patiait begins to 

01 



Q 



20 approximate that which is observed in the normal subject. The results obtained fi^om successive assays 
may be used to show the efficacy of treatment over a period ranging from several days to months. 

With respect to canc^, the presence of an abnormal amount of transcript (either und^- or 
ovorexpressed) in biopsied tissue from an individual may indicate a predisposition fc»' the development of 
the disease, or may provide a means for detecting the disease prior to the appearance of actual clinical 
25 symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier thereby prevmting the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding RECAP 
may involve the use of PCR. These oligom^s may be chemically synthesized, gen^ated ^izymatically, 
30 or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide encoding 

RECAP, or a fragment of a polynucleotide complementary to the polynucleotide mcoding RECAP, and 
will be ^ployed und^ q)timized conditions for identification of a specific gene or condition. Oligom^s 
may also be employed under less stringent conditions for detection or quantification of closely related 
DNA or RNA sequences. 

35 In a particular aspect, oligonucleotide primers daived from the polynucleotide sequences 
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encoding RECAP may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions^ insertions and deletions that are a frequent cause of inherited or acquired genetic disease in 
humans. Methods of SNP detection include, but are not limited to, single-stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers 
5 dmved from the polynucleotide sequ^ces encoding RECAP are used to amplify DN A using the 
polymerase chain reaction (PGR). The DNA may be derived, for example, from diseased or normal 
tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause diffwences in the secondary 
and tatiary structures of PCR products in single-stranded form, and these differences are detectable 
using gel electrophoresis in non-denaturing gels. In fSCCP, the oligonucleotide primes are fluorescently 
10 labded, which allows d^ection of the amplimers in high-througlq>ut equipment such as DNA sequencing 
machines. Additionally, sequence database analysis methods, termed in silico SNP (isSNP), are capable 
of identifying polymorphisms by comparing the sequence of individual overlapping DNA fragments 
which assemble into a common consensus sequence. These computa'-based methods filter out sequence 
5 variations due to laboratory prq>aration of DNA and sequencing errors using statistical models and 
yj 15 automated analyses of DNA sequence chromatograms. In the alternative, SNPs may be detected and 
J charactoized by mass spectrom^ry using, for example, the high througtq)ut MASS ARRAY system 
C3 (Sequaiom, Inc., San Diego CA). 

5 Methods which may also be used to quantify the expression of RECAP include radiolabeling or 

p biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 

U 20 standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et 

^ al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 

fy accelerated by running the assay in a high-througlqjut format wha-e the oligomer or polynucleoticte of 

interest is presented in various dilutions and a spectrc^hotometric or colOTimetric respoase gives rapid 

quantitation. 

25 In furtho" embodiments, oligonucleotides or longer fragments derived from any of the 

polynucleotide sequences described herein may be used as elements on a microarray. The microarray can 
be used in transcript imaging techniques which monitor the relative expression lev^s of large numb^s of 
genes simultaneously as described in Seilhamer, J. J. et al., "Comparative Gene Transcript Analysis," 
U.S. Patent No. 5,840,484, incorporated herein by reference. The microarray may also be used to 

30 identify genetic variants, mutations, and polymorphisms. This information may be used to determine 
gene function, to understand the genetic basis of a disordCT, to diagnose a disorder, to monitor 
progression/regression of disease as a frinction of gene expression, and to develop and monitor the 
activities of therapeutic agents in the treatment of disease. In particular, this information may be used to 
develop a pharmacogenomic profile of a patient in ordCT to select the most appropriate and effective 

35 treatment regimen for that patient. For example, therapeutic agents which are highly effective and 
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display the fewest side effects may be selected for a patient based on his/her pharmacogenomic profile. 

In another embodiment, antibodies spcxnfic for RECAP, or RECAP or fragments thereof may be 
used as elements on a microarray. The microarray may be used to monitor or measure protein-protein 
interactions, drug-target impactions, and gene expression profiles, as described above. 
5 A particular mbodiment relates to the use of the polynucleotides of the present invention to 

generate a transcript image of a tissue or cdl type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression pattens are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at a 
givai time. (See Sdlham^ et al., "Comparative Gene Transcript Analysis," U.S. Patent Numba 
10 5,840,484, expressly incorporated by refCTence herein.) Thus a transcript image may be gen^ated by 
hybridizing the polynucleotides of the presm invention or their complements to the totality of transcripts 
or reverse transcripts of a particular tissue or cell type. In one embodimmt, the hybridization takes place 
in high-throughput format, wherdn the polynucleotides of the present inv^tion or their conq^lements 
comprise a subset of a plurality of elements on a microarray. The resultant transcript image would 
15 provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 
or other biological samples. The transcript image may thus reflea gene expression in vivo , as in the case 
of a tissue or bic^sy sample, or in vitro , as in the case of a cell line. 

Transcript images which proHle the ^pression of the polynucleotides of the present invention 
20 may also be used in conjuncticm with in vitro model systems and preclinical evaluation of 
G pharmacaiticals, as well as toxicological testing of industrial and naturally-occurring aivironmental 

compounds. All compounds induce characteristic gene expression patterns, frequently termed molecular 
fmgorprints or toxicant signatures, which are indicative of mechanisms of action and toxicity G^uwaysir, 
E.F. al. (1999) Mol. Carcinog. 24:153-159; Steins, S. and N.L. Anderson (2000) Toxicol. Lett. 112- 
25 11 3:467-471, expressly incorporated by refo'ence ha'ein). If a test compound has a signature similar to 
that of a compound with known toxicity, it is likely to share those toxic propaties. These fingerprints or 
signatures are most useiul and refined when they contain expression information from a large number of 
genes and gene families. Ideally, a genome-wide measurement of expression provides the highest quality 
signature. Even genes whose expression is not altered by any tested compounds are important as well, as 
30 the levels of expression of these genes are used to normalize the rest of the expression data. The 
normalization procedure is useful for comparison of expression data after treatment with difterent 
compounds. While the assignment of gene function to elements of a toxicant signature aids in 
interpretation of toxicity mechanisms, knowledge of gene function is not necessary for the statistical 
matching of signatures which leads to prediction of toxicity. (See, for example. Press Release (X)-02 
35 from the National Institute of Environmental Health Sciences, released February 29, 2(X)0, available at 
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http://www.niehs.nih.gov/oc/news/toxchip.htm,) Therefore, it is important and desirable in toxicological 
screening using toxicant signatures to include all expressed gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by ueating a biological sample 
containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 
5 biological sample are hybridized with one or more probes specific to the polynucleotides of the present 
invention, so that transcript levels corresponding to the polynucleotides of the present invention may be 
quantified. The transcript levels in the treated biological sample are compared with levels in an 
untreated biological sample. Differences in the transcript levels between the two samples are indicative 
of a toxic response caused by the test compound in the treated sample. 
10 AnothCT particular embodiment relates to the use of the polypeptide sequences of the present 

invention to analyze the proteome of a tissue or cell type. The tsnn proteome refas to tiie global pattern 
of protdn expression in a particular tissue or cell type. Each protein component of a proteome can be 
subjected individually to further analysis. Proteome expression patterns, or profiles, are analyzed by 
quantifying tiie number of expressed proteins and thdr relative abundance under given conditions and at a 
15 given time. Aprofileof a cell's proteome may thus be g^^ated by sq>arating and analyzing the 

polypeptides of a particular tissue or cell type. In one onbodiment, the separation is achieved using two- 
dimensional gel electrophoresis, in which proteins from a sample are sq>arated by isoelectric focusing in 
the first dimension, and thai according to molecular weight by sodium dodecyl sulfate slab gel 
electrophoresis in ttie second dimension (Stdnar and Anderson, supra ). The protdns are visuaUzed in the 
20 gel as discrete and uniquely positioned spots, typically by staining the gd with an agent such as 
Coomassie Blue or silver or fluorescent stains. The optical dMSity of each protdn spot is generally 
proportional to tiie level of the protein in the sample. The optical densities of equivalenfly positioned 
protdn spots from diffea-ent samples, for example, from biological samples dther treated or untreated 
with a test compound or tha-apeutic agent, are compared to identify any changes in protdn spot density 
25 related to the treatment. The protdns in the spots are partially sequenced using, for example, standard 
metiiods employing chemical or enzymatic cleavage followed by mass spectrometry. The identity of ttie 
protdn in a spot may be det^mined by comparing its partial sequence, preferably of at least 5 contiguous 
amino acid residues, to the polyp€?)tide sequences of the present inventioiL In some cases, furtiier 
sequence data may be obtained for definitive protein identification. 
30 A proteomic profile may also be generated using antibodies spedfic for RECAP to quantify the 

levels of RECAP expression. In one embodiment, the antibodies are used as elements on a microarray, 
and protein expression levels are quantified by exposing the microarray to Uie sample and detecting ttie 
levels of protein bound to each array element (Luddng, A. et al. (1999) Anal. Biochem. 270:103-1 1 1 ; 
Mendoze. L.G. et al. (1999) Biotechniques 27:778-788). Detection miay be performed by a variety of 
35 metiiods known in the art, for sample, by reacting tiie proteins in tiie sample witii a tiiiol- or amino- 
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reactive fluorescent compound and detecting the amount of fluorescence bound at each array dement. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and should 
be analyzed in parallel with toxicant signatures at the transcript level. There is a poor correlation 
between transcript and protein abundances for some proteins in some tissues (Anderson. N.L. and J. 
5 Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the 
analysis of compounds which do not significantly affect the transcript image, but which alter the 
proteomic profile. In addition, the analysis of transcripts in body fluids is difflcult, due to rapid 
degradation of mRN A, so proteomic proflling may be more rdiable and informative in such cases. 
In anoth^ embodiment, the toxicity of a test compound is assessed by treating a biological 

10 sample containing protdns with the test compound. Proteins that are expressed in the treated biological 
sanq^le are separated so that the amount of each protein can be quantified. The amount of each protdn is 
compared to the amount of the ccnesponding protein in an untreated biological sample. A difference in 
the amount of protdn between the two samples is indicative of a toxic response to the test compound in 
the treated sample. Individual protdns are identified by sequencing the amino acid residues of the 

15 individual protdns and con^aring these partial sequences to the polyp^tides of the present invention. 
In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sanq)le containing proteins with the test compound. Protdns from the biological sample are incubated 
with antibodies specific to the polypq)tides of the present invention. The amount of protdn recognized 
by the antibodies is quantified. The amount of protdn in the treated biological sample is compared with 

20 the amount in an untreated biological sample. A diffearrace in the amount of protdn t>etween the two 
samples is indicative of a toxic response to the test compound in the treated sample. 

Microarrays may be prq)ared, used, and analyzed using methods known in the art. (See, e.g., 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Sch^a, M. et al. (1996) Proc, Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschwdler et al. (1995) PCT application W095/251116; Shalon. D, et al. 

25 (1995) PCT application WO95/35505; Heller, R. A et al. (1997) Proc. Nati. Acad. Sci. USA 94:2150- 
2155; and Hello-, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are well 
known and thoroughly described in DNA Microarravs: A Practical Approach . M. Schena, ed. (1999) 
Oxford Univa-sity Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding RECAP may be used to 

30 gmerate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members of 
a multi-gene family may potentially cause undesired cross hybridization during chromosomal mapping. 
The sequences may be mapped to a particular chromosome, to a specific region of a chromosome, or to 

35 artificial chromosome constructions, e.g., human artificial chromosomes (HACs), yeast artificial 
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chromosomes (YACs). bact^ial artificial chromosomes (BACs), bactarial PI constructions, or single 
chromosome cDNA libraries. (See, e.g., Harrington. JJ. et al, (1997) Nat. Genet. 15:345-355; Price, 
CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 7:149-154.) Once mapped, 
the nucleic acid sequences of the invention may be used to develop genetic linkage maps, for example, 
5 which correlate the inheritance of a disease state with the inheritance of a particular chromosome region 
or restriction fragment length polymorphism (RFLP). (See, e.g.. Lander, E.S. and D. Botstein (1986) 
Proc. Nad. Acad. Sci. USA 83:7353-7357.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic map 
data. (See, e.g.. Heinz-Ulrich, et al. (1995) in Meyers, supra > pp. 965-968.) Examples of genetic map 
10 data can be found in various scientific journals or at the Online M enddian Inheritance in Man (OMIM) 
World Wide Web site. Correlation betwe^ the location of the gene encoding RECAP on a physical map 
and a specific disorder, or a predisposition to a specific disorder, may help define the region of DN A 
^ associated with that disorder and thus may furth^ positional cloning efforts. 

■RUBS? 

Q In situ hybridization of chromosomal preparations and physical mapping techniques, such as 

15 linkage analysis using established chromosomal markers, may be used for extending genetic maps. Often 
the placement of a g^ie on the chromosome of another mammalian species, such as mouse, may reveal 
associated markers even if the exact chromosomal locus is not known. This information is valuable to 
investigators searching for disease genes using positional cloning or other gene discovery techniques. 
Once the gttie or genes responsible for a disease or syndrome have be^ crudely localized by genetic 
20 linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences mapping to 
that area may represent associated or regulatory genes f(x further investigation. (See, e.g., Gatti, R.A. et 
al. (1988) Nature 336:577-580.) The nucleotide sequence of the instant invoition may also be used to 
detea differences in the du-omosomal location due to translocation, invo-sion, etc., among normal, 
carrier, or affected individuals, 
25 In anoth^ embodiment of the invration, RECAP, its catalytic or immunogenic Iragm^ts, or 

oligcf)eptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cdll surface, or located intracellularly. The formation of binding complexes 
between RECAP and the agent being tested may be measured. 
30 Another technique for drug screening provides for high throughput screening of compounds 

having suitable binding affinity to the protein of interest. (See, e.g., Geysen. et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test confounds are 
synthesized on a solid substrate. The test compounds are reacted with RECAP, or fragments thereof, and 
washed. Bound RECAP is then detected by methods we]] known in the art. Purified RECAP can also be 
35 coated directly onto plates for use in the aforementioned drug screening techniques. Alternatively, 
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non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding RECAP specifically compete with a test compound for binding RECAP. 
In this manner, antibodies can be used to detect the presence of any pq)tide which shares one or more 
5 antigenic determinants with RECAP. 

In additional embodiments, the nucleotide sequences which encode RECAP may be used in any 
molecular biology techniques that have yet to be developed, provided the new techniques rdy on 
propmies of nucleotide sequences that are currentiy known, including, but not limited to, such propmies 
as the triplet genetic code and specific base pair interactions. 
10 Without further elaboration, it is believed that one skilled in the art can, using the preceding 

description, utilize the present invention to its fullest extent The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and. below, in 
15 particular U.S. Sen No. 60/145,232, U.S. Ser. No. 60/158,578, and U.S. Ser. No. 60/165.192, are 
hereby expressly incorporated by reference. 

EXAMPLES 

I* Construction of cDNA Libraries 

20 RNA was purchased from Clontech oc isolated from tissues described in Table 4. Some tissues 

were homogenized and lysed in guanidinium isothiocyanate> while oth^s w^e homogenized and lysed in 
phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic 
solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl 
cushions or extracted with chloroform. RNA was precipitated from the lysates with dther isoprc^anol or 

25 sodium acetate and ethanol, or by othor routine methods. 

Phenol extraction and precipitation of RNA were rq)eated as necessary to increase RNA purity. 
In some cases, RNA was treated with DNase. For most libraries, poly(A-h) RNA was isolated using 
oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alt^natively, RNA was 

30 isolated directly from tissue lysates using other RNA isolation kits. e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDN A was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 

35 recommended procedures or similar methods known in the art. (See. e.g., Ausubd, 1997, supra , units 
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5.1-6.6.) Reverse transcriplion was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adaptCTS were ligated to double stranded cDNA. and the cDN A was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL SIOOO, SEPH AROSE CL2B. oc SEPHAROSE CL4B column chromatography 
5 (AmCTsham Pharmacia Biotech) or preparative agarose gd electrophoresis. cDNAs wctc ligated into 
compatible restriction enzyme sites of the polyUnkCT of a suitable plasmid, e.g.. PBLUESCRIPT plasmid 
(Stratagene), PSPORTl plasmid (Life Technologies), pcDNA2.1 plasmid (Invitrogen, Carlsbad C A), or 
pINCY plasmid (Incyte Genomics, Palo Alto CA). Recombinant plasmids wea-e transformed into 
competent E. coU cdls including XLl-Blue, XLl-BlueMRF, or SOLR from Stratagene or DH5a, 
10 DHIOB, or ElectroMAX DHIOB from Life Technologies. 
U. Isolation of cDNA Clones 

Plasmids obtaiited as described in Example I were recovered from host cdls by in vivo excision 
using the UNIZAP veaor system (Stratagene) or by cell lysis. Plasmids w^e purified using at least one 
of the following: a Magic or WIZARD Minipreps DNA purification system (Promcga); an AGTC 
15 Miniprep purification kit (Edge Biosystems. Gaith^sburg MD); and QIAWELL 8 Plasmid, QIAWELL 
8 Plus Plasmid, QIAWELL 8 Ultra Plasmid puriflcation systems or the R.E. A.L. PREP 96 plasmid 
purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 ml of distilled 
5 water and stored, with or without lyophilization, at 4**C. 

Alternatively, plasmid DNA was anq[>lified from host cell lysates using direa link PCR in a high- 
20 throughputformat(Rao,V.B. (1994) Anal. Biochem. 216:1-14). Host cdl lysis and thermal cycling 
steps were carried out in a single reaction mixture. Samples were processed and stored in 384-well 
plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluOTescence scanner 
(Labsystems Oy, Helsinki. Finland). 
25 IIL Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example n were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation such 
as the ABI CATALYST 800 (PE BiosysteoLs) thermal cycler or tiie PTC-200 tiiCTmal cyclo- (MJ 
Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or ttie MICROLAB 
30 2200 (Hamilton) hquid transfer system. cDN A sequencing reactions were prepared using reagents 
provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as the ABI PRISM 
BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). Electrophoretic separation of 
cDNA sequencing reactions and detection of labeled polynucleotides were carried out using the 
MEGABACE 1000 DNA sequencing systOTi (Molecular Dynamics); the ABI PRISM 373 or 377 
35 sequencing system (PE Biosystems) in conjunction with standard ABI protocols and base calUng 
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software; or oth^ sequence analysis systems known in the art. Reading frames within the cDN A 
sequences were identified using standard methods (reviewed in Ausubel, 1997, supra , unit 7.7). Some of 
the cDN A sequences ware selected for extension using the techniques disclosed in Example V. 

The polynucleotide sequences dorived from cDN A sequencing w^e assembled and analyzed 
5 using a combination of software programs which utilize algorithms well known to those skilled in the art. 
Table 5 summarizes the tools, programs, and algorithms used and provides applicable descriptions, 
ref^ences, and threshold paramet^s. The first column of Table 5 shows the tools, programs, and 
algorithms used, the second column provides brief desariptions thereof, the third column presents 
appropriate references, all of which are incorporated by reference herdn in their entirety, and the fourth 
10 column presents, where applicable, the scores, probability values, and oth^ parameta's used to evaluate 
the strength of a match between two sequences (the higher the score, the greater the homology between 
two sequaices). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 
Enginemng, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide and 
polypeptide sequence alignments were gena'ated using the ddault param^ers specified by the clustal 

B_S_S 

M» 15 algorithm as incorporated into the MEGALIGN multisequaice alignmait program (DNASTAR), which 

also calculates the percGCA identity between aligned sequences. 
«P The polynucleotide sequences v/er& validated by removing vector, linker, and polyA sequences 

Q and by masking ambiguous bases, using algorithms and progranLS based on BLAST, dynamic 

p programing, and dinucleotide nearest ndghbor analysis. The sequences were then quoried against a 

Oj 20 selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, and 

eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire annotation 
using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled into full 
length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and were screened 
for open reading frames using programs based on GeneMark, BLAST, and FASTA. The fiill length 
25 polynucleotide sequences were translated to derive the corresponding ftjll l^gth amino acid sequences, 
and these ftili length sequences were subsequently analyzed by querying against databases such as the 
GenBank databases (described above), SwissProt. BLOCKS, PRINTS, DOMO, PRODOM. Prosite, and 
Hidden Markov Modd (HMM)-based protein family databases such as PFAM. HMM is a probabilistic 
approach which analyzes consensus primary structures of gene families. (See, e.g., Eddy, S.R. (1996) 
30 Curr.Opin. Struct. Biol. 6:361-365.) 

The programs described above tor the assembly and analysis of full length polynucleotide and 
amino acid sequences w^e also used to identify polynucleotide sequence fragments from SEQ ID NO:23- 
44. Fragments from about 20 to about 4000 nucleotides which are useftil in hybridization and 
amplification technologies were described in The Invention section above. 
35 rv« Analysis of Polynucleotide Expression 
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Northem analysis is a laboratory technique used to detect the presence of a transcript of a gene 
and involves the hybridization of a labeled nucleotide sequence to a membrane on which RN As from a 
particular cdl type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel. 1995, supra, 
ch. 4and 16.) 

5 Analogous computer techniques applying BLAST were used to search for identical or related 

molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is much 
fast^ than multiple membrane-based hybridizations. In addition, the sensitivity of the compute: search 
can be modified to determine whether any particular match is categorized as exact or similar. The basis 
of the search is the product score, which is defined as: 

10 BLAST Score x Percent Identity 

5 X minimum {length(Seq. 1), length(Seq. 2)} 



^7 



The product score takes into account both the degree of similarity between two sequences and the length 
of the sequOTce match. The product score is a normalized value between 0 and 100, and is calculated as 

15 follows: the BLAST score is multiplied by the parcent nucleotide id^itity and the produa is divided by 
(5 times the length of the shorter of the two sequeaices). The BLAST score is calculated by assigning a 
score of +5 for evCTy base that matches in a high-scoring segment pair (HSP), and -4 for every mismatch. 
Two sequences may share more than one HSP (s^arated by gaps). If there is more than one HSP, then 
the pair with the highest BLAST score is used to calculate the product score. The product score 

20 rq)resents a balance b^een fractional ovarlap and quality in a BLAST alignment. For example, a 
product score of 100 is produced only for 100% identity over the entire Imgth of the slKMter of the two 
sequences being compared. A product score of 70 is produced eithar by 100% identity and 70% overlap 
at one end, or by 88% identity and 100% overlap at the othCT. A product score of 50 is produced eitiiar 
by 100% identity and 50% overlap at one end, or 79% identity and 100% overlap. 

25 The results of northan analyses are reported as a p^centage distribution of libraries in which the 

traascript encoding RECAP occurred. Analysis involved the categorization of cDNA libraries by 
organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, radocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease/condition categories included cancer, inflanmiation, trauma, cdl 

30 prolif^ation, neurological, and pooled. For each category, the numb^ of Ubraries expressing the 
sequence of into-est was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in Table 3. 
V. Chromosomal Mapping of RECAP Encoding Polynucleotides 

The cDN A sequences which were used to assemble SEQ ID NO:23-44 were compared with 

35 sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
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implementations of the Smith- Watennan algorithm. Sequences from these databases that matched SEQ 
ID NO:23-44 were assembled into clusters of contiguous and overlapping sequences using assembly 
algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available from public 
resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for Genome 
'I 5 Research (WIGR), and G6n6thon were used to determine if any of the clust^ed sequences had been 
previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment of all 
sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

Genetic map locations are reported as ranges, or intervals, of human chromosomes. 
The map position of an int^val, in centiMorgans, is measured relative to the terminus of the 
10 chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination 
frequencies between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase 
(Mb) of DNA in humans, although this can vary widely due to hot and cold spots of recombination.) 
The cM distances are based on genetic markers mapped by G6n6thon which provide boundaries for 
S radiation hybrid mark^s whose sequences were included in each of the clusters. SEQ ID NO:24 maps 

15 to chromosome 1 within the interval from 12.8 to 22.9 centiMorgans. SEQ ID NO:36 maps to 
chromosome 1 within the interval from 74.8 to 78.3 centiMorgans. 
^ VI. Extension of RECAP Encoding Polynucleotides 

^ The full length nucleic acid sequences of SEQ ID NO;23-44 wctc produced by extension of an 

appropriate fragment of the full Iragth molecule using oligonucleotide primers designed from this 

i ! 3 

p 20 fragment. One primer was synthesized to initiate 5' ext^ision of the known fragment, and the oth^ 
P primer, to initiate 3' extension of the known fragment. The initial primers were designed using OLIGO 

4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides 
in length, to have a GC content of about 50% or more, and to anneal to the target sequence at 
temperatures of about 68 "'C to about 72 ""C. Any stretch of nucleotides which would result in hairpin 
25 structures and primCT-primo* dimerizations was avoided. 

Selected human cDN A libraries w^e used to extend the sequence. If mcxc than one extension 
was necessary or desired, additional or nested sets of primers were designed 

High flddity amplification was obtained by PGR using methods wdl known in the art. PGR was 
performed in 96-wdl plates using the PTG-200 thermal cycl«- (MJ Research, Inc.). The reaction mix 
30 contained DNA template, 200 nmol of each primer, reaction buffer containing Mg^*, (NH4)2S04. and p- 
mercaptoethanol, Taq DNA polyma-ase (Am^sham Pharmacia Biotech), ELONG ASE enzyme (Life 
Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for prima* pair PCI 
A and PGI B: Step 1: 94**G, 3 min; Step 2: 94 '^C, 15 sec; Step 3: 60*G, 1 min; Step 4: 68**G. 2 min; 
Step 5: Steps 2, 3, and 4 rq)eated 20 times; Step 6: 68 **G, 5 min; Step 7: storage at 4**G. In the 
35 alternative, the paramet^s for prima* pair T7 and SK+ wae as follows: St^ 1 : 94**G, 3 min; Step 2: 

61 



SUBSTITUTE SHEET (RULE 26) 



wo 01/07612 PCT/USOO/20035 

94^C, 15 sec; Step 3: 57^C, 1 min; Stq) 4: 68**C, 2 min; Step 5: Steps 2. 3, and 4 repeated 20 times; 
Stqp 6: 68**C, 5 min; Step 7: storage at 4'*C. 

The concentration of DNA in each well was detca-mined by dispensing 100 pi PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE and 

5 0.5 jil of undiluted PGR product into each well of an opaque fluorimet«- plate (Corning Costar, Acton 
MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II (Labsystems 
Oy, Helsinki, Finland) to measure the fluoresc^e of the sample and to quantify the concentration of 
DNA. A 5 A^l to 10 aliquot of the reaction mixture was analyzed by electrophoresis on a 1 % agarose 
mini-gel to determine which reactions were successful in extending the sequence. 

10 The extaided nucleotides were desalted and concentrated, transferred to 384-well plates, digested 

with Cvi JI cholera virus endonuclease (Molecular Biology Research, Madison WI), and sonicated or 
sheared prior to religation into pUC 1 8 vectCM: (Ama-sham Pharmacia Biotech). For shotgun sequmdng, 
the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose gels, fragments were 
excised, and agar digested with Agar ACE (Promega). Extended clones vrerc religated using T4 Ugase 

15 (New England Biolabs, BevCTly MA) into pUC 1 8 vector (Amersham Pharmacia Biotech), treated with 
Pfij DNA polymerase (Stratagene) to filWn restriction site overhangs, and transfected into compet^t E, 
coli cells. Transformed cdls were selected on antibiotic-containing media, and individual colonies ware 
picked and cultured overnight at 37**C in 384-well plates in LB/2x carb liquid media. 

The cells w^elysed, and DNA was amplified by PGR using Taq DNA polymerase (Am^sham 

20 Pharmacia Biotech) and Pfti DNA polym^ase (Stratagene) with the following parameters: St^ 1 : 94 '^C, 
3 min; Step 2: 94**C, 15 sec; Stq> 3: 60**G, 1 min; Step 4: 72**C, 2 min; Step 5: steps 2, 3, and 4 
repeated 29 times; Step 6: 72**G, 5 min; St^ 7: storage at 4"C. DNA was quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples widi low DNA recoveries were reamplified 
using the same conditions as described above. Samples were diluted with 20% dimethysulfoxide (1 :2, 

25 v/v), and sequenced using DYEN AMIC energy transfer sequencing primers and the DYENAMIC 

DIRECT kit (AmCTSham Pharmacia Biotech) or ttie ABI PRISM BIGDYE Terminator cycle sequmcing 
ready reaction kit (PE Biosystems). 

In like manner, the polynucleotide sequences of SEQ ID NO:23-44 are used to obtain 5* 
regulatory sequences using Uie procedure above, along with oligonucleotides designed for such extension, 

30 and an appropriate genomic library. 

VII. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:23-44 are employed to screen cDNAs, genomic 
DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base pairs, is 
specifically described, essentially the same procedure is used with larger nucleotide fragments. 

35 Oligonucleotides are designed using staie-of-the-art software such as OLIGO 4.06 software (National 
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Bioscimces) and labeled by combining 50 pmol of each oligomer, 250 fiCi of [y-^^P] adenosine 
triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston MA), 
The labeled oligonucleotides are substantially puriHed using a SEPHADEX G-25 superfine size 
exclusion dextran bead column (Am^sham Pharmacia Biotech). An aliquot containing 10^ counts p^ 
5 minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 
genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or 
Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transfm-ed to nylon 
membranes (Nytran Plus, ScMeich^ & Schuell, Durham NH). Hybridization is carried out for 16 hours 
10 at40*'C. To rmiove nonspecific signals, blots are sequentially washed at room tenq)^ature under 
conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

VIII. Microarrays 

15 The linkage or synthesis of array dements upon a microarray can be achieved utilizing 

photolithography, piezoelectric printing (ink-jet printing. See, e.g.. Baldeschwdler, sut>ra) . mechanical 
microspotting technologies, and derivatives thereof. The substrate in each of the aforementioned 
technologies should be uniform and solid with a non-porous surface (Schena (1999), supra) . Suggested 
substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alt^natively, a procedure 
20 analogous to a dot or slot blot may also be used to arrange and link dements to the surface of a substrate 

'•.rs 

using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be produced using 

ssrV 

IJ available methods and machines wdl known to those of ordinary skill in the art and may contain any 

appropriate numbear of demits. (See, e.g., Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et 
al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat Biotechnol. 16:27-31.) 

25 Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 

comprise the elements of the microarray. Fragments or oligom^s suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The array 
dements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent labd or othar molecular tag for ease of detection. After 

30 hybridization, nonhybridized nucleotides from the biological sample are removed, and a fluorescence 
scanner is used to detect hybridization at each array dOTient. Altemativdy, lasCT desorbtion and mass 
spectrometry may be used for detection of hybridization. The degree of complementarity and the relative 
abundance of each polynucleotide which hybridizes to an element on the microarray may be assessed. In 
one embodiment, microarray preparation and usage is described in detail below. 

35 Tissue or Cell Sample Preparation 

63 



SUBSTITUTE SHEET (RULE 26) 



wo 01/07612 



PCT/USOO/20035 



Li. 
IS 



Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A)* RNA is purified using the oligo-(dT) cellulose method. Each poIy(A)* RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/pl oligo-(dT) primer (21mCT), IX first 
strand buffer. 0.03 units/pl RNase inhibitor, 500 \M dATP, 500 dGTP, 500 dTTP, 40 pM 
5 dCTP, 40 mM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng j)oly(A)* RNA with 
GEMBRIGHT kits (Incyte). Specific control poly(A)* RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37 *C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
10 incubated for 20 minutes at 85 *C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH). Palo Alto C A) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml)» 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and resuspended 
15 in 14 Ml 5X SSaO.2% SDS. 
Microarrav Preparation 

Sequences of the present invention are used to generate array elements. Each array element is 
amplified from bacterial cells containing vectors with cloned cDNA inserts. PGR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert Array elements are 
20 amplified in thirty cycles of PGR from an initial quantity of 1-2 ng to a final quantity greater than 5 ^g. 
Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia Biotech). 
I Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 

slides (Coming) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR Scientific 
25 Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and coated 
with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C oven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 pi of the array element DNA, at an average 
concentration of 100 ng/jil, is loaded into the open capillary printing element by a high-speed robotic 
30 apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60**C followed by washes in 0.2% 
35 SDS and distilled water as before. 
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Hybridization 

Hybridization reactions contain 9 jil of sample mixture consisting of 0.2 |ig each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65 **C for 5 minutes and is aliquoted onto the microarray surface and covered with 

5 an 1 .8 cm^ coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger tfian a microscope slide. The chamber is kept at 100% humidity internally by the addition of 140 
^ of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for about 6.5 
hours at 60X. The arrays are washed for 10 min at 45 **C in a first wash buffer (IX SSC, 0.1% SDS), 
three times for 10 minutes each at 45 ^'C in a second wash buffer (O.IX SSC), and dried. 

10 Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the airay using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 

15 containing the array is placed on a computer-controlled X-Y stage on the microscope and rastw- 
scanned past the objective. The 1 .8 cm x 1 ,8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 

20 Hamamatsu Photonics Systems, BridgewatCT NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

25 The sensitivity of the scans is typically calibrated using the signal intensity generated by a 

cDN A control species added to the sample mixture at a known concentration, A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that location 
to be correlated with a weight ratio of hybridizing species of 1 : 100,000. When two samples from 
different sources (e.g., representing test and control cells), each labeled with a different fluorophore, are 

30 hybridized to a single array for the purpose of identifying genes that are differentially expressed, the 
calibration is done by labeling samples of the calibrating cDNA with the two fluorophores and adding 
identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 

35 computer. The digitized data are displayed as an image where the signal intensity is mapped using a 

65 



SUBSTITUTE SHEET (RULE 26) 



wo 01/07612 



PCT/USOO/20035 



linear 20-color transfonnation to a pseudocolor scale ranging from blue Oow signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission 
spectra) l)etween the fluorophores using each fluorophore's emission spectrum. 
5 A grid is superimposed over the fluorescence signal image such that the signal from each spot 

is centered in each el^ent of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used for 
signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 

IX. Complementary Polynucleotides 

10 Sequences complementary to the RECAP-encoding sequences* or any parts thereof, are used to 

detect, decrease, or inhibit expression of naturally occurring RECAP. Although use of oligonucleotides 
comprising from about IS to 30 base pairs is described, essentially the same procedure is used with 
smalls or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 4.06 
software (National Biosci^ces) and the coding sequence of RECAP. To inhibit transcription, a 

15 compl^entary oligonucleotide is designed from the most unique 5' sequence and used to prevent 
promote- binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the RECAP-encoding transcript. 

X. Expression of RECAP 

Expression and purification of RECAP is achieved using bact^ial or virus-based expression 

20 systems- For expression of RECAP in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDN A 
transcriptioa Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 
promoter and the T5 or T7 baaeriophage promoter in conjunction with the lac operator regulatory 
e]ement. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(D£3). Antibiotic 

25 resistant bacteria express RECAP upon induction with isopropyl b^-D-thiogalactopyranoside (IPTG). 
Expression of RECAP in eukaryotic cdls is achieved by infecting insect or mammalian cell lines with 
recombinant Autograohica californica nuclear polyhedrosis virus ( AcMNPV), commonly known as 
baculovirus. The nonessential ix>lyhedrin gene of baculovinis is rq>laced with cDNA encoding RECAP 
by either homologous recombination or bact^ial-mediated transposition involving o-ansfer plasmid 

30 intermediates. Viral infectivity is maintained and the strong polyhedrin promote* drives high levels of 
cDNA transcriptioa Recombinant baculovirus is used to infect SpodODtera frugiperda fSf9^ insect cells 
in most cases, or human hq^atocytes, in some cases. Infection of the latto* requires additional genetic 
modifications to baculovirus. (See Engelhard. E.K. et al. (1994) Proc. Natl. Acad. Sci. USA 91:3224- 
3227; Sandig, V. et al. (1996) Hum. GeneThca:. 7:1937-1945.) 

35 In most expression systems, RECAP is synthesized as a fusion protein with, e.g., glutathione S- 
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transfi^ase (GST) or a peptide qpitope tag, such as FLAG or 6-His, permitting rapid, single-stq>, 
affinity-based purification of recombinant fusion protdn from crude cell lysates. GST, a 26-kilodaUon 
enzyme from Schistosoma iaponicum > enables the purification of hision proteins on immobilized 
glutathione under conditions that maintain protdn activity and antigenicity (Amersham Pharmacia 
5 Biotech). Following purification, the GST moiety can be proteolytically cleaved from RECAP at 

specifically engine^ed sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification using 
commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6-Hjs, a 
stretch of six consecutive histidine residues, enables purification on metal-chelate resins (QIAGEN). 
Mediods for protdn expression and purification are discussed in Ausubel (199S, supra , ch. 10 and 16). 
10 Purified RECAP obtained by these methods can be used directly in the assays shown in Examples XI and 
XV. 

XL Demonstration of RECAP Activity 

Rec^tcM' activity of RECAP is d^^mined in a ligand-binding assay using candidate ligand 
O molecules in the pres^ce of ^^Mabeled RECAP. RECAP is labded with Bolton-Hunter reagent. 

^ 15 (See, e.g., Bolton, A.E. and W.M. Hunto* (1973) Biochem. J. 133:529-539). Candidate ligand molecules 
previously arrayed in the wells of a muld-wdl plate are incubated with the labeled RECAP, washed, and 

G 

^ any wdls with labeled RECAP conqplex are assayed. Data obtained using different concaitrations of 

1,, RECAP are used to calculate values for the number, affinity, and assodation of RECAP with the ligand 

Q 

molecules. The levd of binding measured is proportional to the level of RECAP activity. 
20 In the alternative, activity of RECAP may be measured using an assay based upon the 

property of some GPCRs to support the in vitro proliferation of fibroblasts and tumor cells under 
SOTJm-flree conditions (Chiquet-Huismann, R, et al. (1986) Cell 47:131-139). Wells in 96 well cluster 
plates (Falcon, Fisher Scientific, Santa Clara CA) are coated with RECAP by incubation with solutions 
at 50*100 Mg/ml for 15 min at ambient temperature. The coating solution is aspirated, and the wells 
25 washed with Dulbecco*s medium before cells are plated. Rat fibroblast cultures or rat mammary tumor 
cells are prepared as described and plated at a density of 10^-10^ cells/ml in Dulbecco's medium 
supplemented with 10% fetal calf sarum (FCS). 

After three days the media are removed, and the cells washed three times with phosphate- 
buffered saline (PBS) before the addition of smim-free Dulbecco's medium containing 0.25 mg/ml 
30 bovine serum albumin (BSA, Fraction V, Sigma Chemical, St. Louis, MO). After 2 days the medium 
is aspirated, and 100 pi of [^H]thymidine (NEN) at 2 |iCi/nil in fresh Dulbecco's medium containing 
0.25 mg/ml BSA added. Parallel plates are fixed and stained to determine cell numbers. After 16 hr, 
the medium is aspirated, the cell lay^ washed with PBS, and the 10% trichloroacetic acid-precipitable 
counts in the cell layer determined by liquid scintillation counting of radioisotope (normalized to 
35 relative cell numbers; Chiquet-Ehrismann, R. et al. (1986) supra). The rates of cell proliferation and 
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[^H]thyinidine uptake are proportional to the activity of RECAP in the sample. 

In the alternative, the assay for RECAP activity is based upon the property of CD97/Emrl 
GPCR family proteins to modulate G protein-activated second messenger signal transduction pathways 
(e.g., cAMP; Gaudin, P., et al. (1998) J. Biol. Chem., 273:4990-4996). A plasmid encoding full length 

5 RECAP is transfected into a mammalian cell line (e.g., COS-7 or Chinese hamster ovary (CHO-Kl) 
cell lines) using methods well-known in the art. Transfected cells are grown in 12- well trays in culture 
medium containing 2% PCS for 48 hours, the culture medium is discarded, then the attached cells are 
gently washed with PBS. The cells are then incubated in culture medium with 10% PCS or 2% PCS 
for 30 minutes, then the medium is removed and cells lysed by treatment with 1 M perchloric acid. 

10 The cAMP levels in the lysate are measured by radioimmunoassay using methods well-known in the 
art. Changes in the levels of cAMP in the lysate from 10% PCS-treated cells compared with those in 
2% PCS-treated cells are proportional to the activity of RECAP present in the transfected cells. 

In another alternative, an assay for RECAP activity is based on a prototypical assay for 
ligand/receptor-mediated modulation of cell proliferation. This assay measures the rate of DN A 

15 synthesis in Swiss mouse 3T3 cells. A plasmid containing polynucleotides encoding RECAP is added to 
quiescent 3T3 cultured cells using translection methods wdl known in the art. The transiently 
transfected cells are then incubated in the presence of [^HJthymidine, a radioactive DNA precursor 
molecule. Varying amounts of RECAP ligand are then added to the cultured cdls. Incorporation of 
pHJthymidine into acid-precipitable DNA is measured ovct an appropriate time interval using a 

20 radioisotope counter, and the amount incorporated is directly propcMtional to the amount of newly 
synthesized DNA. A linear dose-response curve over at least a hundred-fold RECAP ligand 
concentration range is indicative of receptor activity. One unit of activity per millilitCT is defined as the 
concentration of RECAP producing a 50% response level, wh^e 100% represents maximal incOTporation 
of [^H]thymidine into acid-predpitable DNA (McKay, I. and Leigh, I., eds. (1993) Growth Pactors: A 

25 Practical Approach , Oxford Univarsity Press, New York, NY, p. 73,) 

In the alternative, tiie assay for RECAP activity is based upon the abiUty of GPCR family 
proteins to modulate G protein-activated second messenger signal transduction pathways (e.g., cAMP; 
Gaudin, P. et al. (1998) J. Biol. Chem. 273:4990-4996). A plasmid encoding full length RECAP is 
transfected into a mammalian cell line (e.g., Chinese hamster ovary (CHO) or human embryonic kidney 

30 (HEK-293) cell lines) using methods well-known in the art. Transfected cells are grown in 12-well 
trays in culture medium for 48 hours, then the culture medium is discarded, and the attached cells are 
gentty washed with PBS. The cells are then incubated in culture medium with or without ligand for 30 
minutes, then the medium is removed and cells lysed by treatment with 1 M perchloric acid. The 
cAMP levels in the lysate are measured by radioimmunoassay using methods well-known in the art. 

35 Changes in the levels of c AMP in the lysate from cells exposed to ligand compared to those without 
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ligand are proportional to the amount of RECAP present in the transfected cells. 
XII* Functional Assays 

RECAP ftinction is assessed by expressing the sequences encoding RECAP at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
5 vector containing a strong promoter that drives high levels of cDN A expression. Vectors of choice 
include pCMV SPORT (Life Technologies) and pCR3.1 Onvitrogen, Carlsbad CA), both of which 
contain the cytomegalovirus promote, 5-10 ^g of recombinant vector are transiently transfected into a 
human cell line, for example, an endothdial or hanatopoietic cell line, using either liposome formulations 
or electroporation. 1-2 of an additional plasmid containing sequaices encoding a marker proton are 
10 co-transfected. Expression of a marker protein provides a means to distinguish transfected cells from 
nontransfected cells and is a reliable predictor of cDNA expression from the recombinant vector. MarkCT 
proteins of choice include, e.g.. Green Fluorescent Protein (GFP; Clontech). CD64. or a CD64-GFP 
1^ fusion protein. Flow cytometry (FCM), an automated, laser optics-based technique, is used to identify 

^ transfected cells expressing GFP or CD64-GFP and to evaluate tiie apoptotic state of the cells and other 

O 

y 15 cellular properties. FCM detects and quantifies the uptake of fluorescent molecules that diagnose events 
2 preceding or coincident with cell death. These events include changes in nuclear DN A content as 

O measured by staining of DNA with propidium iodide; changes in cdl size and granularity as measured by 

forward light scatty and 90 degree side light scatter; down-regulation of DNA synthesis as measured by 
decrease in bromodeoxyuridine uptake; alto-ations in expression of cell surface and intracellular protdns 
20 as measured by reactivity with specific antibodies; and alt^ations in plasma membrane composition as 
2 measured by the bimling of fluorescein-conjugated Annexin V protein to the cell surface. Methods in 

rU flow cytometry are discussed in Qrmerod, M.G. (1994) Flow Cvtometrv . Oxford, New York NY. 

The influOTce of RECAP on gene expression can be assessed using highly purified populations of 
cdLs transfected with sequmces ^coding RECAP and dtiier CD64 or CD64-GFP. CD64 and CD64- 
25 GFP are expressed on the surface of transfected cells and bind to consarved regions of human 

immunoglobulin G (IgG). Transfected cells are efficienfly separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success NY). 
niRNA can be purified from the cells using methods wdl known by those of skill in the art. Expression 
of mRNA encoding RECAP and othar g^es of into-est can be analyzed by northern analysis oc 
30 microarray techniques. 

XIII- Production of RECAP Specific Antibodies 

RECAP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Mettiods Enzymol. 182:488-495), or othCT purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 
35 Alternatively, the RECAP amino acid sequence is analyzed using LASERGENE software 
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(DNASTAR) to detCTmine regions of high inununogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate q)itopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art (See, e.g., Ausubel, 1995, supra, ch, 11.) 

5 Typically, oligopeptides of about 1 5 residues in laigth are synthesized using an ABI 43 1 A 

peptide synthesizer (PE Biosystans) using FMOC chamstry and coupled to KLH (Sigma- Aldrich, St. 
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide estCT (MBS) to increase 
immunogenicity. (See, e.g., Ausubel, 1995. suECa.) Rabbits are immunized with the oUgq)eptid&.KLH 
complex in complete Freund's adjuvant. Resulting antisora are tested for antipqptide and anti-REC AP 

10 activity by, for example, binding the peptide or RECAP to a substrate, blocking with 1% BSA, reacting 
with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XIV. Purification of Naturally Occurring RECAP Using Specific Antibodies 
Naturally occurring or recombinant RECAP is substantially purified by immunoafBnity 

chromatography using antibodies specific for RECAP. An immunoaffinity column is constructed by 
15 covalentty coupling anti-RECAP antibody to an activated chromatographic resin, such as 

CNBr-activated SEPHAROSE (AmCTsham Pharmacia Biotech). After the coupUng, tiie resin is blocked 

and washed according to the manufacturer's instructions. 

Media containing RECAP are passed over the immunoafBnity column, and the column is washed 

under conditions that allow the prefarential absorbance of RECAP (e.g., high ionic strength buffa-s in the 
20 presence of detCTgrat). The column is eluted undor conditions that disrupt antibody/RECAP binding 

(e.g., a buffa: of pH 2 to pH 3, or a high conc€aitration of a chaotrope. such as urea thiocyanate ion), 

and RECAP is collected. 

XV. Identification of Molecules Which interact with RECAP 

RECAP, or biologically active fragmaits thereof, are labeled with Bolton-HuntCT reagent. 
25 (See, e.g., Bolton AE. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 

previously arrayed in the wells of a multi-well plate are incubated with the labded RECAP, washed, and 
any wells with labeled RECAP complex are assayed. Data obtained using different concentrations of 
RECAP are used to calculate values for the numbCT, affinity, and association of RECAP with the 
candidate molecules. 

30 Alternatively, molecules interacting with RECAP are analyzed using the yeast two-hybrid 

system as described in Fields, S. and O. Song (1989, Nature 340:245-246), or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

RECAP may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput mann^ to determine all intCTactions 

35 between the proteias encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
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No. 6,057,101). 



Various modifications and variations of the described methods and systems of the invention will 
be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
5 Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are obvious to 
those skilled in molecular biology or related fidds are intended to be within the scope of the following 
claims. 
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